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Background Of The Invention 

A common technique for cloning receptors is to use nucleic acid hybridization 
technology to identify receptors which are homologous to other, known receptors. For 
instance, originally the cloning of seven transmembrane domain G protein-coupled receptors 
(GCR) depended on the isolation and sequencing of the corresponding protein or the use of 
expression cloning techniques. However, when sequences for these receptors became 
available, it was apparent that there were significant sequence homologies between these 
receptors. This technology, since it does not require that the ligand of the receptor have 
been identified, has resulted in the cloning of a large number of "orphan receptors", which 
have no known ligand and often whose biological function is obscure. Receptors of all types 
comprise this large family. Known orphan receptors include the nuclear receptors COUP- 
TF1/EAR3, COUP-TF2/ARP 1 , EAR-1, EAR-2, TR-2, PPAR1, HNF-4, ERR-1, ERR-2, 
NGFIB/Nur77, ELP/SF-1 and MPL (Parker et al, supra, and Power et al. (1992) TIBS 
13:318-323). A large number of orphan receptors have been identified in the EPH family 
(Hirai et al (1987) Science 238:1717-1720). HER3 and HER4 are orphan receptors in the 
epidermal growth factor receptor family (Plowman et al. (1993) Proc. Natl. Acad. Sci. USA 
90:1746-1750). ILA is a newly identified member of the human nerve growth factor/tumor 
necrosis factor receptor family (Schwarz et al. (1993) Gene 134:295-298). IRRR is an 
orphan insulin receptor-related receptor which is a transmembrane tyrosine kinase (Shier et 
al. (1989) J. Biol Chem 264:14606-14608). Several orphan tyrosine kinase receptors have 
been found in Drosophila (Perrimon (1994) Curr. Opin. Cell Biol 6:260-266). The 
importance of identifying ligands for orphan receptors is clear; it opens up a wide area for 
research in the area of drug discovery. 

Qne large subgroup of orphan receptors, as alluded to above, are found in the G 
protein coupled receptor family. Approximately 100 such receptors have been identified by 
function and these mediate transmembrane signaling from external stimuli (vision, taste and 
smell), endocrine function (pituitary and adrenal), exocrine function (pancreas), heart rate, 
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lipolysis, and carbohydrate metabolism. Structural and genetic similarities suggest that G 
protein-coupled receptor superfamily can be subclassified into 15 ve distinct groups: (i) amine 
receptors (serotonin, adrenergic, etc.); (ii) small peptide hormone (somatostatin, TRH, etc.); 
(iii) large peptide hormone (LH-CG, FSH, etc.); (iv) secretin family; and (v) odorant 
receptors (Buck L. and Axel, R. (1991) Cell 65:175-187), with orphan receptors apparently 
occurring in each of the sub-families. 

Previous work describes the expression of recombinant mammalian G 
protein-coupled receptors as a means of studying receptor function as a means of identifying 
agonists and antagonists of those receptors. For example, the human muscarinic receptor 
(HM1) has been functionally expressed in mouse cells (Harpold et al. US Pat. 5,401,629). 
The rat VI b vasopressin receptor has been found to stimulate phosphotidy. inositol hydrolysis 
and intracellular Ca2+ mobilization in Chinese hamster ovary cells upon agonist stimulation 
(Lolait et al. (1995) Proc Natl AcadSci. USA 92:6783-6787). Likewise, the C5a receptor {to 
be completed} These types of ectopic expression studies have enabled researchers to study 
receptor signalling mechanisms and to perform mutagenisis studies which have been useful 
in identifying portions of receptors that are critical for ligand binding or signal transduction. 

Experiments have also been undertaken to express functional G protein coupled 
receptors in yeast cells. For example, U.S. Patent 5,482,835 to King et al. describes a 
transformed yeast cell which is incapable of producing a yeast G protein a subunit, but which 
has been engineered to produce both a mammalian G protein a— subunit and a mammalian 
receptor which is "coupled to" (i.e., interacts with) the aforementioned mammalian G protein 
a-subunit. Specifically, U.S. Patent 5,482,835 reports expression of the human beta-2 
adrenergic receptor (P2AR), a seven transmembrane receptor (STR), in yeast, under control 
of the GAL1 promoter, with the P2AR gene modified by replacing the first 63 base pairs of 
coding sequence with 1 1 base pairs of noncoding and 42 base pairs of coding sequence from 
the STE2 gene. (STE2 encodes the yeast a-factor receptor). The Duke researchers found that 
the modified P2AR was functionally integrated into the membrane, as shown by studies of 
the ability of isolated membranes to interact properly with various known agonists and 
antagonists of p2AR. The ligand binding affinity for yeast-expressed p2AR was said to be 
nearly identical to that observed for naturally produced P2AR. 

U.S. Patent 5,482,835 describes co-expression of a rat G protein a-subunit in the 
same cells, yeast strain 8C, which lacks the cognate yeast protein. Ligand binding resulted in 
G protein-mediated signal transduction. U.S. Patent 5,482,835 teaches that these cells may 
be used^in screening compounds for the ability to affect the rate of dissociation of Got from 
Gpy in a cell. For this purpose, the cell further contains a pheromone-responsive promoter 
(e.g. BAR1 or FUS1), linked to an indicator gene (e.g. HIS3 or LacZ). The cells are placed in 
multi-titer plates, and different compounds are placed in each well. The colonies are then 



scored for expression of the indicator gene. 

Summary of the Invention 

The present invention relates to a rapid, reliable and effective assay for screening and 
identifying pharmaceutically effective compounds that specifically interact with and 
modulate the activity of a cellular receptor or ion channel. The subject assay enables rapid 
screening of large numbers of polypeptides in a library to identifying those polypeptides 
which agonize or antagonize receptor bioactivity. In general, the assay is characterized by 
the use of a library of recombinant cells, each cell of which include (i) a target receptor 
protein whose signal transduction activity can be modulated by interaction with an 
extracellular signal, the transduction activity being able to generate a detectable signal, and 
(ii) an expressible recombinant gene encoding an exogenous test polypeptide from a 
polypeptide library. By the use of a variegated gene library, the mixture of cells collectively 
express a variegated population of test polypeptides. In preferred embodiments, the 
polypeptide library includes at least 10 3 different polypeptides, though more preferably at 
least 10 5 , 10 6 , or 10 7 different (variegated) polypeptides. The polypeptide library can be 
generated as a random peptide library, as a semi-random peptide library (e.g., based on 
combinatorial mutagenesis of a known ligand), or as a cDNA library. 

The ability of particular constituents of the peptide library to modulate the signal 
transduction activity of the target receptor can be scored for by detecting up or down- 
regulation of the detection signal. For example, second messenger generation via the 
receptor can be measured directly. Alternatively, the use of a reporter gene can provide a 
convenient readout. In any event, a statistically significant change in the detection signal can 
be used to facilitate isolation of those cells from the mixture which contain a nucleic acid 
encoding a test polypeptide which is an effector of the target receptor. 

By this method, test polypeptides which induce receptor signaling can be identified. 
If the test polypeptide does not appear to directly induce the activity of the receptor protein, 
the assay may be repeated and modified by the introduction of a step in which the 
recombinant cell is first contacted with a known activator of the target receptor to induce the 
signal transdution pathways from the receptor. In one embodiment, the test polypeptide is 
assayed for its ability to antagonize, e.g., inhibit or block the activity of the activator. 
Alternatively, the assay can score for peptides from the peptide library which potentiate the 
induction response generated by treatment of the cell with a known activator. As used 
herein, an "agonist" refers to agents which either induce activation of receptor signalling 
pathways, e.g., such as by mimicking a ligand for the receptor, as well as agents which 
potentiate the sensitivity of the receptor to a ligand, e.g., lower the concentrations of ligand 
required to induce a particular level of receptor-dependent signalling. 


In one embodiment of the present invention the reagent cells express the receptor of 
interest endogenously. In yet other embodiments, the cells are engineered to express a 
heterlogous target receptor protein. In either of these embodiments, it may be desirable to 
inactivate one or more endogenous genes of the host cells. For example, certain preferred 
embodiments in which a heterlogous receptor is provided utilize host cells in which the gene 
for the homologous receptor has been inactivated. Likewise, other proteins involved in 
transducing signals from the target receptor can be inactivated, or complemented with an 
ortholog or paralog from another organism, e.g., yeast G protein subunits can be 
complemented by mammalian G protein subunits in yeast cells also engineered to express a 
mammalian G protein coupled receptor. Other complementations include, for example, 
expression of heterologous MAP kinases or erk kinases, MEKs or MKKs (MAP kinase 
kinases), MEKKs (MEK kinases), ras, raf, STATs, JAKs and the like. 

The receptor protein can be any receptor which interacts with an extracellular 
molecule (i.e. hormone, growth factor, peptide) to modulate a signal in the cell. To illustrate 
the receptor can be a cell surface receptor, or in other embodiments can be an intracellular 
receptor. In preferred embodiments, the receptor is a cell surface receptor, such as: a receptor 
tyrosine kinase, e.g., an EPH receptor; an ion channel; a cytokine receptor; an multisubunit 
immune recognition receptor, a chemokine receptor; a growth factor receptor, or a G-protein 
coupled receptor, such as a chemoattracttractant peptide receptor, a neuropeptide receptor, a 
light receptor, a neurotransmitter receptor, or a polypeptide hormone receptor. 

Preferred G protein coupled receptors include alA-adrenergic receptor, alB- 
adrenergic receptor, a2-adrenergic receptor, a2B-adrenergic receptor, pi -adrenergic 
receptor, p2-adrenergic receptor, P3 -adrenergic receptor, ml acetylcholine receptor (AChR), 
m2 AChR, m3 AChR, m4 AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, 
D3 dopamine receptor, D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, 
A2b adenosine receptor, 5-HT la receptor, 5-HTlb receptor, 5HTl-like receptor, 5-HTld 
receptor, 5HT ld-like receptor, 5HTld beta receptor, substance K (neurokinin A) receptor, 
fMLP receptor, fMLP-like receptor, angiotensin II type 1 receptor, endothelin ETA receptor, 
endothelin ETB receptor, thrombin receptor, growth hormone-releasing hormone (GHRH) 
receptor, vasoactive intestinal peptide receptor, oxytocin receptor, somatostatin SSTR1 and 
SSTR2, SSTR3, cannabinoid receptor, follicle stimulating hormone (FSH) receptor, leutropin 
(LH/HCG) receptor, thyroid stimulating hormone (TSH) receptor, thromboxane A2 receptor, 
platelet-activating factor (PAF) receptor, C5a anaphylatoxin receptor, Interleukin 8 (IL-8) IL- 
8RA, IL-8RB, Delta Opioid receptor, Kappa Opioid receptor, mip-l/RANTES receptor, 
Rhodopsin, Red opsin, Green opsin, Blue opsin, metabotropic glutamate mGluRl-6, 
histamine H2 receptor, ATP receptor, neuropeptide Y receptor, amyloid protein precursor 
receptor, insulin-like growth factor II receptor, bradykinin receptor, gonadotropin-releasing 
hormone receptor, cholecystokinin receptor, melanocyte stimulating hormone receptor 


receptor, antidiuretic hormone receptor, glucagon receptor, and adrenocorticotropic hormone 
II receptor. 

Preferred EPH receptors inlcude eph, elk, eck 9 sek, mek4, hek, hek2, eek, erk, tyrol, 
tyro4, tyro5 9 tyro6, tyroll, cek4, cekS, cek6, cek7, cek8, cek9, ceklO, bsk, rtkl, rtk2, rtk3, 
mykl, myk2, ehkl, ehk2, pagliaccio, htk, erk and nuk receptors. 

As set forth below, no matter which stmctural/function class to which the target 
receptor may belong, the subject assay is amenable to identifying ligands for an otherwise 
orphan receptor. 

In those embodiments wherein the target receptor is a cell surface receptor, it will be 
desirable for the peptides in the library to express a signal sequence to ensure that they are 
processed in the appropriate secretory pathway and thus are available to interact with 
receptors on the cell surface. 

With respect to a detection signal generated by signal transduction, certain of the 
preferred embodiments measure the production of second messengers to determine changes 
in ligand engagement by the receptor. In preferred embodiments, changes in GTP hydrolysis, 
calcium mobilization, or phospholipid hydrolysis can be measured. 

In other preferred embodiment, the host cells harbors a reporter construct containing a 
reporter gene in operative linkage with one or more transcriptional regulatory elements 
responsive to the signal transductin activity of the receptor protein. Exemplary reporter 
genes include enzymes, such as luciferase, phosphatase, or p-galactosidase which can 
produce a spectrometrically active label, e.g., changes in color, fluorescence or luminescence, 
or a gene product which alters a cellular phenotype, e.g., cell growth, drug resistance or 
auxotrophy. In preferred embodiments: the reporter gene encodes a gene product selected 
from the group consisting of chloramphenicol acetyl transferase, beta-galactosidase and 
secreted alkaline phosphatase; the reporter gene encodes a gene product which confers a 
growth signal; the reporter gene encodes a gene product for growth in media containing 
aminotriazole or canavanine. 

The reagent cells of the present invention can be derived from any eukaryotic 
organism. In preferred embodiments the cells are mammalian cells. In more preferred 
embodiments the cells are yeast cells, with cells from the genera Saccharomyces or 
Schizosaccharomyces being more preferred. However, cells from amphibia (such as 
xenopus), avian or insect sources are also contemplated. The host cells can derived from 
primary cells, or transformed and/or immortalized cell lines. 

Brief Description of the Drawings 


Figure 1. Structures of pAAH5 and pRS-ADC. 


Figure 2. Schematic diagram of the structure of the plasmid resulting from insertion 
of random oligonucleotides into pADC-MF alpha. This plasmid expresses random peptides 
in the context of the MF alpha 1 signal and leader peptide. 

Figure 3. Schematic diagram of the structure of the plasmid resulting from insertion 
of random oligonucleotides into pADC-MFa. This plasmid expresses random peptides in the 
context of the MFal leader and C-terminal CVIA tetrapeptide. 

Figure 4. Activity of a fusl promoter in response to signaling by human C5a 
expressed in autocrine strains of yeast. 

Detailed Description of the Invention 

Proliferation, differentiation and death of eukaryotic cells are controlled by hormones, 
neurotransmitters, and polypeptide factors. These diffusible ligands allow cells to influence 
and be influenced by environmental cues. The study of receptor-ligand interaction has 
revealed a great deal of information about how cells respond to external stimuli, and this 
knowledge has led to the development of therapeutically important compounds. However, 
the rate at which receptors have been cloned has recently increased dramatically — existing 
families have been extended and new families recognized. In particular, the application of 
advanced cloning approaches has allowed the isolation of many receptors for which Hgands 
are initially unknown. These are commonly referred to in the art as "orphan" receptors, and 
several have subsequently proved to be important pharmacological targets. 

The present invention makes available a rapid, effective assay for screening and 
identifying pharmaceutically effective compounds that specifically interact with and 
modulate the activity of a cellular receptor or ion channel. The subject assay enables rapid 
screening of large numbers of polypeptides in a library to identifying those polypeptides 
which induce or antagonize receptor bioactivity. 

In general, the assay is characterized by the use of a mixture of recombinant cells to 
sample a variegated polypeptide library for receptor agonists or antagonists. As described 
with greater detail below, the reagent cells express both a target receptor protein capable of 
transducing a detectable signal in the reagent cell, and a test polypeptide for which 
interaction with the receptor is to be ascertained. Collectively, a culture of such reagent cells 
will prqvide a variegated library of potential receptor effectors and those members of the 
library which either agonize or antagonize the receptor function can be selected and identified 
by sequence. 

One salient feature of the subject assay is the enhanced sensitivity resulting from 


expression of the test polypeptide in a cell which also serves as a reporter for the desired 
receptor-ligand interaction. To illustrate, where the detectable signal resulting from receptor 
engagement by an agonist provides a growth signal or drug resistance, individual cells 
expressing polypeptides which agonize receptor function can be amplified and isolated from 
a library culture. 

Accordingly, the present invention provides a convenient format for discovering 
drugs which can be useful to modulate cellular function, as well as to understand the 
pharmacology of compounds that specifically interact with cellular receptors or ion channels. 
Moreover, the subject assay is particularly amenable to identifying ligands, natural or 
artifical, for orphan receptors. 

Before further description of the invention, certain terms employed in the 
specification, examples and appended claims are, for convenience, collected here. 

As used herein, "recombinant cells" include any cells that have been modified by the 
introduction of heterologous DNA. Control cells include cells that are substantially identical 
to the recombinant cells, but do not express one or more of the proteins encoded by the 
heterologous DNA, e.g., do not include or express the reporter gene construct, receptor or test 
polypeptide. 

The terms "recombinant protein", "heterologous protein" and "exogenous protein" are 
used interchangeably throughout the specification and refer to a polypeptide which is 
produced by recombinant DNA techniques, wherein generally, DNA encoding the 
polypeptide is inserted into a suitable expression vector which is in turn used to transform a 
host cell to produce the heterologous protein. That is, the polypeptide is expressed from a 
heterologous nucleic acid. 

As used herein, "heterologous DNA" or "heterologous nucleic acid" include DNA 
that does not occur naturally as part of the genome in which it is present or which is found in 
a location or locations in the genome that differs from that in which it occurs in nature. 
Heterologous DNA is not endogenous to the cell into which it is introduced, but has been 
obtained from another cell. Generally, although not necessarily, such DNA encodes RNA 
and proteins that are not normally produced by the cell in which it is expressed. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA that one of skill in 
the art would recognize or consider as heterologous or foreign to the cell in which is 
expressed is herein encompassed by heterologous DNA. Examples of heterologous DNA 
include, but are not limited to, DNA that encodes test polypeptides, receptors, reporter genes, 
transcriptional and translational regulatory sequences, selectable or traceable marker proteins, 
such as a protein that confers drug resistance. 

As used herein, "cell surface receptor" refers to molecules that occur on the surface of 
cells, interact with the extracellular environment, and transmit or transduce the information 


regarding the environment intracellularly in a manner that ultimately modulates transcription 
of specific promoters, resulting in transcription of specific genes. 

As used herein, "extracellular signals" include a molecule or a change in the 
environment that is transduced intracellularly via cell surface proteins that interact, directly 
or indirectly, with the signal. An extracellular signal or effector molecule includes any 
compound or substance that in some manner specifically alters the activity of a cell surface 
protein. Examples of such signals include, but are not limited to, molecules such as 
acetylcholine, growth factors and hormones, that bind to cell surface and/or intracellular 
receptors and ion channels and modulate the activity of such receptors and channels. 

As used herein, "extracellular signals" also include as yet unidentified substances that 
modulate the activity of a cellular receptor, and thereby influence intracellular functions. 
Such extracellular signals are potential pharmacological agents that may be used to treat 
specific diseases by modulating the activity of specific cell surface receptors. 

"Orphan receptors" is a designation given to a receptors for which no specific natural 
ligand has been described. 

As used herein, a "reporter gene construct" is a nucleic acid that includes a "reporter 
gene" operatively linked to a transcriptional regulatory sequences. Transcription of the 
reporter gene is controlled by these sequences. The activity of at least one or more of these 
control sequences is directly or indirectly regulated by the target receptor protein. The 
transcriptional regulatory sequences include the promoter and other regulatory regions, such 
as enhancer sequences, that modulate the activity of the promoter, or regulatory sequences 
that modulate the activity or efficiency of the RNA polymerase that recognizes the promoter, 
or regulatory sequences are recognized by effector molecules, including those that are 
specifically induced by interaction of an extracellular signal with the target receptor. For 
example, modulation of the activity of the promoter may be effected by altering the RNA 
polymerase binding to the promoter region, or, alternatively, by interfering with initiation of 
transcription or elongation of the mRNA. Such sequences are herein collectively referred to 
as transcriptional regulatory elements or sequences. In addition, the construct may include 
sequences of nucleotides that alter translation of the resulting mRNA, thereby altering the 
amount of reporter gene product. 

"Signal transduction" is the processing of chemical signals from the cellular 
environment through the cell membrane, and may occur through one or more of several 
mechanisms, such as phosphorylation, activation of ion channels, effector enzyme activation 
via guanine nucleotide binding protein intermediates, formation of inositol phosphate, 
activation of adenylyl cyclase, and/or direct activation (or inhibition) of a transcriptional 
factor. 

The term "modulation of a signal transduction activity of a receptor protein" in its 
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various grammatical forms, as used herein, designates induction and/or potentiation, as well 
as inhibition of one or more signal transduction pathways downstream of a receptor. 

Agonists and antagonists are "receptor effector" molecules that modulate signal 
transduction via a receptor. Receptor effector molecules are capable of binding to the 
receptor, though not necessarily at the. binding site of the natural ligand. Receptor effectors 
can modulate signal transduction when used alone, i.e. can be surrogate ligands, or can alter 
signal transduction in the presence ot the natural ligand, either to enhance or inhibit signaling 
by the natural ligand. For example, "antagonists" are molecules that block or decrease the 
signal transduction activity of receptor, e.g., they can competitively, noncompetitively, 
and/or allosterically inhibit signal transduction from the receptor, whereas "agonists" 
potentiate, induce or otherwise enhance the signal transduction activity of a receptor. The 
terms "receptor activator" and "surrogate ligand" refer to an agonist which induces signal 
transduction from a receptor. 

The term "substantially homologous", when used in connection with amino acid 
sequences, refers to sequences which are substantially identical to or similar in sequence, 
giving rise to a homology in conformation and thus to similar biological activity. The term is 
not intended to imply a common evolution of the sequences. 

Typically, "substantially homologous" sequences are at least 50%, more preferably at 
least 80%, identical in sequence, at least over any regions known to be involved in the 
desired activity. Most preferably, no more than five residues, other than at the termini, are 
different. Preferably, the divergence in sequence, at least in the aforementioned regions, is in 
the form of "conservative modifications". 

The term "autocrine cell", as used herein, refers to a cell which produces a substance 
which can stimulate a receptor located on or within the same cell as produces the substance. 
For example, wild-type yeast a and a cells are not autocrine. However, a yeast cell which 
produces both a-factor and a-factor receptor, or both a-factor and a-factor receptor, in 
functional form, is autocrine. By extension, cells which produce a peptide which is being 
screened for the ability to activate a receptor (e.g., by activating a G protein-coupled 
receptor) express the receptor are called "autocrine cells", though it might be more precise to 
call them "putative autocrine cells". Of course, in a library of such cells, in which a multitude 
of different peptides are produced, it is likely that one or more of the cells will be "autocrine" 
in the stricter sense of the term. 

The terms "protein", "polypeptide" and "peptide" are used interchangeably herein. 


/, Overview of Assay 

As set out above, the present invention relates to methods for identifying effectors of 
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a receptor protein or complex thereof. In general, the assay is characterized by the use of a 
library of recombinant cells, each cell of which include (i) a target receptor protein whose 
signal transduction activity can be modulated by interaction with an extracellular signal, the 
transduction activity being able to generate a detectable signal, and (ii) an expressible 
recombinant gene encoding an exogenous test polypeptide from a polypeptide library. By the 
use of a variegated gene library, the mixture of cells collectively express a variegated 
population of test polypeptides. 

The ability of particular constituents of the peptide library to modulate the signal 
transduction activity of the target receptor can be scored for by detecting up or down- 
regulation of the detection signal. For example, second messenger generation (e.g. GTPase 
activity, phospholipid hydrolysis, or protein phosphorylation) via the receptor can be 
measured directly. Alternatively, the use of a reporter gene can provide a convenient readout. 
In any event, a statistically significant change in the detection signal can be used to facilitate 
isolation of those cells from the mixture which contain a nucleic acid encoding a test 
polypeptide which is an effector of the target receptor. 

By this method, test polypeptides which induce the receptor's signaling can be 
screened. If the test polypeptide does not appear to induce the activity of the receptor 
protein, the assay may be repeated and modified by the introduction of a step in which the 
recombinant cell is first contacted with a known activator of the target receptor to induce 
signal transduction from the receptor, and the test polypeptide is assayed for its ability to 
inhibit the activity of the receptor, e.g., to identify receptor antagonists. In yet other 
embodiments, the peptide library can be screened for members which potentiate the response 
to a known activator of the receptor. In this respect, surrogate ligands identified by the 
present assay for orphan receptors can be used as the exogenous activator, and further peptide 
libraries screened for members which potentiate or inhibit the activating peptide. 
Alternatively, the surrogate ligand can be used to screen exogenous compound libraries 
(peptide and non-peptide) which, by modulating the activity of the identified surrogate, will 
presumably also similarly effect the native ligand's effect on the target receptor. In such 
embodiments, the surrogate ligand can be applied to the cells, though is preferably produced 
by the reagent cell, thereby providing an autocrine cell. 

In developing the recombinant cells assays, it was recognized that a frequent result of 
receptor-mediated responses to extracellular signals was the transcriptional acitivation or 
inactivation of specific genes after exposure of the cognate receptor to an extracellular signal 
that induces such activity. Thus, transcription of genes controlled by receptor-responsive 
transcriptional elements often reflects the activity of the surface protein by virtue of 
transduction of an intracellular signal. 

To illustrate, the intracellular signal that is transduced can be initiated by the specific 


interaction of an extracellular signal, particularly a ligand, with a cell surface receptor on the 
cell. This interaction sets in motion a cascade of intracellular events, the ultimate 
consequence of which is a rapid and detectable change in the transcription or translation of a 
gene. By selecting transcriptional regulatory sequences that are responsive to the transduced 
intracellular signals and operatively linking the selected promoters to reporter genes, whose 
transcription, translation or ultimate activity is readily detectable and measurable, the 
transcription based assay provides a rapid indication of whether a specific receptor or ion 
channel interacts with a test peptide in any way that influences intracellular transduction. 
Expression of the reporter gene, thus, provides a valuable screening tool for the development 
of compounds that act as agonists or antagonists of a cell receptor or ion channel. 

Reporter gene based assays of this invention measure the end stage of the above 
described cascade of events, e.g., transcriptional modulation. Accordingly, in practicing one 
embodiment of the assay, a reporter gene construct is inserted into the reagent cell in order to 
generate a detection signal dependent on receptor signaling. Typically, the reporter gene 
construct will include a reporter gene in operative linkage with one or more transcriptional 
regulatory elements responsive to the signal transduction activity of the target receptor, with 
the level of expression of the reporter gene providing the receptor-dependent detection signal. 
The amount of transcription from the reporter gene may be measured using any method 
known to those of skill in the art to be suitable. For example, specific mRNA expression may 
be detected using Northern blots or specific protein product may be identified by a 
characteristic stain or an intrinsic activity. 

In preferred embodiments, the gene product of the reporter is detected by an intrinsic 
activity associated with that product. For instance, the reporter gene may encode a gene 
product that, by enzymatic activity, gives rise to a detection signal based on color, 
fluorescence, or luminescence. 

The amount of expression from the reporter gene is then compared to the amount of 
expression in either the same cell in the absence of the test compound or it may be compared 
with the amount of transcription in a substantially identical cell that lacks the specific 
receptors. A control cell may be derived from the same cells from which the recombinant cell 
was prepared but which had not been modified by introduction of heterologous DNA, e.g., 
the encoding the test polypeptide. Alternatively, it may be a cell in which the specific 
receptors are removed. Any statistically or otherwise significant difference in the amount of 
transcription indicates that the test polypeptide has in some manner altered the activity of the 
specific receptor. 

In other preferred embodiments, the reporter or marker gene provides a selection 
method such that cells in which the peptide is a ligand for the receptor have a growth 
advantage. For example the reporter could enhance cell viability, relieve a cell nutritional 



requirement, and/or provide resistance to a drug. 

With respect to the target receptor, it may be endogenously expressed by the host cell, 
or it may be expressed from a heterologous gene that has been introduced into the cell. 
Methods for introducing heterologous DNA into eukaryotic cells are of course well known in 
the art and any such method may be used. In addition, DNA encoding various receptor 
proteins is known to those of skill in the art or it may be cloned by any method known to 
those of skill in the art. In certain embodiments, such as when an exogenous receptor is 
expressed, it may be desirable to inactivate, such as by deletion, a homologous receptor 
present in the cell. 

The subject assay is useful for identifying polypeptides that interact with any receptor 
protein whose activity ultimately induces a signal transduction cascade in the host cell which 
can be exploited to produce a detectable signal. In particular, the assays can be used to test 
functional ligand-receptor or ligand-ion channel interactions for cell surface-localized 
receptors and channels, and also for cytoplasmic and nuclear receptors. As described in more 
detail below, the subject assay can be used to identify effectors of, for example, G protein- 
coupled receptors, receptor tyrosine kinases, cytokine receptors, and ion channels, as well as 
steroid hormone receptors. In preferred embodiments the method described herein is used for 
identifying ligands for "orphan receptors" for which no ligand is known. 

In embodiments in which cell surface receptors are the assay targets, it will be 
desirable for each of the peptides of the peptide library to include a signal sequence for 
secretion, e.g., which will ensure appropriate transport of the peptide to the endoplasmic 
reticulum, the golgi, and ultimately to the cell surface so that it is able to interact with cell 
surface receptors. In the case of yeast cells, the signal sequence will transport peptides to the 
periplasmic space. 

Any transfectable cell that can express the desired cell surface protein in a manner 
such the protein functions to intracellularly transduce an extracellular signal may be used. 
The cells may be selected such that they endogenously express the target receptor protein or 
may be genetically engineered to do so. 

The preparation of cells which express the orphan FPRL1 receptor, a peptide library, 
and a reporter gene expression construct, are described. These cells have been used to 
identify a novel ligand for this receptor. The cells for the identification of receptor ligands 
and in drug screening assays to discover agents capable of modulating receptor activity. 

Any cell surface protein that is known to those of skill in the art or that may be 
identified by those of skill in the art may used in the assay. The cell surface protein may 
endogenously expressed on the selected cell or it may be expressed from cloned DNA. 
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//. Host Cells 

Suitable host cells for generating the subject assay include prokaryotes, yeast, or 
higher eukaryotic cells, especially mammalian cells. Prokaryotes include gram negative or 
gram positive organisms. Examples of suitable mammalian host cell lines include the COS-7 
line of monkey kidney cells (ATCC CRL 1651) (Gluzman (1981) Cell 23:175) CV-1 cells 
(ATCC CCL 70), L cells, CI 27, 3T3, Chinese hamster ovary (CHO), HeLa and BHK cell 
lines. 

If yeast cells are used, the yeast may be of any species which are cultivable and in 
which an exogenous receptor can be made to engage the appropriate signal transduction 
machinery of the host celL Suitable species include Kluyverei lactis, Schizosaccharomyces 
pombe, and Ustilaqo maydis; Saccharomyces cerevisiae is preferred. Other yeast which can 
be used in practicing the present invention are Neurospora crassa, Aspergillus niger, 
Aspergillus nidulans, Pichia pas tor is, Candida tropicalis, and Hansenula polymorpha. The 
term "yeast", as used herein, includes not only yeast in a strictly taxonomic sense, i.e., 
unicellular organisms, but also yeast-like multicellular fungi or filamentous fungi. 

The choice of appropriate host cell will also be influenced by the choice of detection 
signal. For instance, reporter constructs, as described below, can provide a selectable or 
screenable trait upon transcriptional activation (or inactivation) in response to a signal 
transduction pathway coupled to the target receptor. The reporter gene may be an 
unmodified gene already in the host cell pathway, such as the genes responsible for growth 
arrest in yeast. It may be a host cell gene that has been operably linked to a 
"receptor-responsive" promoter. Alternatively, it may be a heterologous gene that has been so 
linked. Suitable genes and promoters are discussed below. In other embodiments, second 
messenger generation can be measured directly in the detection step, such as mobilization of 
intracellular calcium or phospholipid metabolism are quantitated. Accordingly, it will be 
understood that to achieve selection or screening, the host cell must have an appropriate 
phenotype. For example, introducing a pheromone-responsive chimeric HIS 3 gene into a 
yeast that has a wild-type HIS3 gene would frustrate genetic selection. Thus, to achieve 
nutritional selection, an auxotrophic strain is wanted. 

To further illustrate, in a preferred embodiment of the subject assay using a yeast host 
cell, the yeast cells possess one or more of the following characteristics: (a) the endogenous 
FUS1 gene has been inactivated; (b) the endogenous SST2 gene, and/or other genes involve 
in desensitization, has been inactivated; (c) if there is a homologous, endogenous receptor 
gene iUhas been inactivated; and (d) if the yeast produces an endogenous ligand to the 
exogenous receptor, the genes encoding for the ligand been inactivated. 

Other complementations for use in the subject assay can be constructed without any 
undue experimentation. Indeed, many yeast genetic complementation with mammalian 


signal transduction proteins have been described in the art. For example, Mosteller et al. 
(1994) Mol Cell Biol 14:1 104-12 demonstrates that human Ras proteins can complement loss 
of ras mutations in S. cerevisiae. Moreover, Toda et al. (1986) Princess Takamatsu Symp 17: 
253-60 have shown that human ras proteins can complement the loss of RAS1 and RAS2 
proteins in yeast, and hence are functionally homologous. Both human and yeast RAS 
proteins can stimulate the magnesium and guanine nucleotide-dependent adenylate cyclase 
activity present in yeast membranes. Ballester et al. (1989) Cell 59: 681-6 describe a vector 
to express the mammalian GAP protein in the yeast S. cerevisiae. When expressed in yeast, 
GAP inhibits the function of the human ras protein, and complements the loss of IRA 1 . IRA1 
is a yeast gene that encodes a protein with homology to GAP and acts upstream of RAS. 
Mammalian GAP can therefore function in yeast and interact with yeast RAS. Wei et al. 
(1994) Gene 151: 279-84 describes that a human Ras-specific guanine nucleotide-exchange 
factor, Cdc25GEF, can complement the loss of CDC25 function in S. cerevisiae. Martegani 
et al. (1992) EMBO J 11: 2151-7 describe the cloning by functional complementation of a 
mouse cDNA encoding a homolog of CDC25, a Saccharomyces cerevisiae RAS activator. 
Vojtek et al. (1993) J Cell Sci 105: 777-85 and Matviw et al. (1992) Mol Cell Biol 12: 5033- 
40 describe how a mouse CAP protein, e.g., an adenylyl cyclase associated protein associated 
with ras-mediated signal transduction, can complements defects in S. cerevisiae. Papasawas 
et al. (1992) Biochem Biophys Res Commun 184:1378-85 also suggest that inactivated yeast 
adenyl cyclase can be complemented by a mammalian adenyl cyclase gene. Hughes et al. 
(1993) Nature 364: 349-52 describe the complementation of byrl in fission yeast by 
mammalian MAP kinase kinase (MEK). Parissenti et al. (1993) Mol Cell Endocrinol 98: 9-16 
describes the reconstitution of bovine protein kinase C (PKC) in yeast. The Ca(2+)- and 
phospholipid-dependent Ser/Thr kinase PKC plays important roles in the transduction of 
cellular signals in mammalian cells. Marcus et al. (1995) PNAS 92: 6180-4 suggests the 
complementation of shkl null mutations in S. pombe by the either the structurally related S. 
cerevisiae Ste20 or mammalian p65PAK protein kinases. 

"Inactivation", with respect to genes of the host cell, means that production of a 
functional gene product is prevented or inhibited. Inactivation may be achieved by deletion of 
the gene, mutation of the promoter so that expression does not occur, or mutation of the 
coding sequence so that the gene product is inactive. Inactivation may be partial or total. 

"Complementation", with respect to genes of the host cell, means that at least partial 
function of inactivated gene of the host cell is supplied by an exogenous nucleic acid. For 
instance, yeast cells can be "mammalianized", and even "humanized", by complementation of 
receptor 1 and signal transduction proteins with mammalian homologs. To illustrate, 
inactivation of a yeast Byr2/Stell gene can be complemented by expression of a human 
MEKK gene. 


///. Expression Systems 

Ligating a polynucleotide coding sequence into a gene construct, such as an 
expression vector, and ti^sforming or transfecting into hosts, either eukaryotic (yeast, 
avian, insect or mammalian) or prokaryotic (bacterial cells), are standard procedures used in 
producing other well-known proteins, including sequences encoding exogenous receptor and 
peptide libraries. Similar procedures, or modifications thereof, can be employed to prepare 
recombinant reagent cells of the present invention by tissue-culture technology in accord with 
the subject invention. 

In general, it will be desirable that the vector be capable of replication in the host cell. 
It may be a DNA which is integrated into the host genome, and thereafter is replicated as a 
part of the chromosomal DNA, or it may be DNA which replicates autonomously, as in the 
case of a plasmid. In the latter case, the vector will include an origin of replication which is 
functional in the host. In the case of an integrating vector, the vector may include sequences 
which facilitate integration, e.g., sequences homologous to host sequences, or encoding 
integrases. 

Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and 
mammalian cellular hosts are known in the art, and are described in, for example, Powels et 
al. {Cloning Vectors: A Laboratory Manual, Elsevier, New York, 1985). Mammalian 
expression vectors may comprise non-transcribed elements such as an origin of replication, a 
suitable promoter and enhancer linked to the gene to be expressed, and other 5' or 3' flanking 
nontranscribed sequences, and 5' or 3* nontranslated sequences, such as necessary ribosome 
binding sites, a poly-adenylation site, splice donor and acceptor sites, and transcriptional 
termination sequences. 

The preferred mammalian expression vectors contain both prokaryotic sequences, to 
facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription 
units that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, 
pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived 
vectors are examples of mammalian expression vectors suitable for transfection of eukaryotic 
cells. Some of these vectors are modified with sequences from bacterial plasmids, such as 
pBR322, to facilitate replication and drug resistance selection in both prokaryotic and 
eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papillomavirus 
(BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used for transient 
expression of proteins in eukaryotic cells. The various methods employed in the preparation 
of the plasmids and transformation of host organisms are well known in the art. For other 
suitable expression systems for both prokaryotic and eukaryotic cells, as well as general 
recombinant procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by 


Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 
and 17. 

The transcriptional and translational control sequences in expression vectors to be 
used in transforming mammalian cells may be provided by viral sources. For example, 
commonly used promoters and enhancers are derived from Polyoma, Adenovirus 2, Simian 
Virus 40 (SV40), and human cytomegalovirus. DNA sequences derived from the SV40 viral 
genome, for example, SV40 origin, early and late promoter, enhancer, splice, and 
polyadenylation sites may be used to provide the other genetic elements required for 
expression of a heterologous DNA sequence. The early and late promoters are particularly 
useful because both are obtained easily from the virus as a fragment which also contains the 
SV40 viral origin of replication (Fiers et al.(1978) Nature 273:1 1 1) Smaller or larger SV40 
fragments may also be used, provided the approximately 250 bp sequence extending from the 
Hind III site toward the Bgl I site located in the viral origin of replication is included. 
Exemplary vectors can be constructed as disclosed by Okayama and Berg (1983, Mol. Cell 
Biol. 3:280). A useful system for stable high level expression of mammalian receptor cDNAs 
in CI 27 murine mammary epithelial cells can be constructed substantially as described by 
Cosman et al (1986, Mol. Immunol. 23:935). Other expression vectors for use in mammalian 
host cells are derived from retroviruses. 

In other embodiments, the use of viral transfection can provide stably integrated 
copies of the expression construct. In particular, the use of retroviral, adenoviral or adeno- 
associated viral vectors is contemplated as a means for providing a stably transfected cell line 
which expresses an exogenous receptor, and/or a polypeptide library. 

A number of vectors exist for the expression of recombinant proteins in yeast. For 
instance, YEP24, YIP5, YEP51, YEP52, pYES2, and YRP17 are cloning and expression 
vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, for example, 
Broach et al. (1983) in Experimental Manipulation of Gene Expression, ed. M. Inouye 
Academic Press, p. 83, incorporated by reference herein). These vectors can replicate in E. 
coli due the presence of the pBR322 ori, and in S. cerevisiae due to the replication 
determinant of the yeast 2 micron plasmid. In addition, drug resistance markers such as 
ampicillin can be used. Moreover, if yeast are used as a host cell, it will be understood that 
the expression of a gene in a yeast cell requires a promoter which is functional in yeast. 
Suitable promoters include the promoters for metallothionein, 3-phosphoglycerate kinase 
(Hitzeman et al., J. Biol. Chem. 255, 2073 (1980) or other glycolytic enzymes (Hess et al., J. 
Adv. Enzyme Req. 7, 149 (1968); and Holland et al. Biochemistry 17, 4900 (1978)), such as 
enolase, glyceraldehyde-3 -phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phospho-fructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate 
kinase, triosephosphate isomerase, phospho-glucose isomerase, and glucokinase. Suitable 


vectors and promoters for use in yeast expression are further described in R. Hitzeman et al., 
EPO Publn. No. 73,657. Other promoters, which have the additional advantage of 
transcription controlled by growth conditions, are the promoter regions for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, and the aforementioned metallothionein and 
glyceraldehyde-3 -phosphate dehydrogenase, as well as enzymes responsible for maltose and 
galactose utilization. Finally, promoters that are active in only one of the two haploid mating 
types may be appropriate in certain circumstances. Among these haploid-specific promoters, 
the pheromone promoters MFal and MFal are of particular interest. 

In some instances, it may be desirable to derive the host cell using insect cells. In 
such embodiments, recombinant polypeptides can be expressed by the use of a baculovirus 
expression system. Examples of such baculovirus expression systems include pVL-derived 
vectors (such as pVL1392, pVL1393 and pVL941), pAcUW-derived vectors (such as 
pAcUWl), and pBlueBac-derived vectors (such as the fi-gal containing pBlueBac III). 

Libraries of random peptides or cDNA fragments may be expressed in a multiplicity 
of ways, including as portions of chimeric proteins. As described below, where secretion of 
the peptide library is desired, the peptide library can be engineered for secretion or transport 
to the extracellular space via the yeast pheromone system 

In constructing suitable expression plasmids, the termination sequences associated 
with these genes, or with other genes which are efficiently expressed in yeast, may also be 
ligated into the expression vector 3* of the heterologous coding sequences to provide 
polyadenylation and termination of the mRNA. 

IK Peripfamic Secretion 

If yeast cells are used as the host cell it will be noted that the yeast cell is bounded by 
a lipid bilayer called the plasma membrane. Between this plasma membrane and the cell wall 
is the periplasmic space. Peptides secreted by yeast cells cross the plasma membrane through 
a variety of mechanisms and thereby enter the periplasmic space. The secreted peptides are 
then free to interact with other molecules that are present in the periplasm or displayed on the 
outer surface of the plasma membrane. The peptides then either undergo re-uptake into the 
cell, diffuse through the cell wall into the medium, or become degraded within the 
periplasmic space. 

The test polypeptide library may be secreted into the periplasm by any of a number of 
exemplary mechanisms, depending on the nature of the expression system to which they are 
linked. In one embodiment, the peptide may be structurally linked to a yeast signal sequence, 
such as that present in the a-factor precursor, which directs secretion through the 


endoplasmic reticulum and Golgi apparatus. Since this is the same route that the receptor 
protein follows in its journey to the plasma membrane, opportunity exists in cells expressing 
both the receptor and the peptide library for a specific peptide to interact with the receptor 
during transit through the secretory pathway. This has been postulated to occur in 
mammalian cells exhibiting autocrine activation. Such interaction could yield activation of 
the response pathway during transit, which would still allow identification of those cells 
expressing a peptide agonist. For situations in which peptide antagonists to externally applied 
receptor agonist are sought, this system would still be effective, since both the peptide 
antagonist and receptor would be delivered to the outside of the cell in concert. Thus, those 
cells producing an antagonist would be selectable, since the peptide antagonist would be 
properly and timely situated to prevent the receptor from being stimulated by the externally 
applied agonist. 

An alternative mechanism for delivering peptides to the periplasmic space is to use 
the ATP-dependent transporters of the STE6/MDR1 class. This transport pathway and the 
signals that direct a protein or peptide to this pathway are not as well characterized as is the 
endoplasmic reticulum-based secretory pathway. Nonetheless, these transporters apparently 
can efficiently export certain peptides directly across the plasma membrane, without the 
peptides having to transit the ER/Golgi pathway. It is anticipated that at least a subset of 
peptides can be secreted through this pathway by expressing the library in context of the 
a-factor prosequence and terminal tetrapeptide. The possible advantage of this system is that 
the receptor and peptide do not come into contact until both are delivered to the external 
surface of the cell. Thus, this system strictly mimics the situation of an agonist or antagonist 
that is normally delivered from outside the cell. Use of either of the described pathways is 
within the scope of the invention. 

The present invention does not require periplasmic secretion, or, if such secretion is 
provided, any particular secretion signal or transport pathway. 

V. Cytokine Receptors 

In one embodiment the target receptor is a cytokine receptor. Cytokines are a family 
of soluble mediators of cell-to-cell communication that includes interleukins, interferons, and 
colony-stimulating factors. The characteristic features of cytokines lie in their functional 
redundancy and pleiotropy. Most of the cytokine receptors that constitute distinct 
superfamilies do not possess intrinsic protein tyrosine kinase domains, yet receptor 
stimulation usually invokes rapid tyrosine phosphorylation of intracellular proteins, including 
the receptors themselves. Many members of the cytokine receptor superfamily acitvate the 
Jak protein tyrosine kinase family, with resultant phosphorylation of the STAT 
transcriptional activator factors. IL-2, IL-7, IL-2 and Interferon y have all been shown to 


activate Jak kinases (Frank et al (1995) Proc Natl Acad Sci USA 92:7779-7783); Scharfe et 
al. (1995) Blood 86:2077-2085); (Bacon et al. (1995) Proc Natl Acad Sci USA 92:7307- 
731 1); and (Sakatsume et al (1995) J. Biol Chem 270:17528-17534). Events downstream of 
Jak phosphorylation have also been elucidated. For example, exposure of T lymphocytes to 
IL-2 has been shown to lead to the phosphorylation of signal transducers and activators of 
transcription (STAT) proteins STAT la, STAT2P, and STAT3, as well as of two STAT- 
related proteins, p94 and p95. The STAT proteins were found to translocate to the nucleus 
and to bind to a specific DNA sequence, thus suggesting a mechanism by which IL-2 may 
activate speicfic genes involved in immune cell function (Frank et al. supra). Jak3 is 
associated with the gamma chain of the IL-2, IL-4, and IL-7 cytokine receptors (Fujii et al. 
(1995) Proc Natl Acad Sci 92:5482-5486) and (Musso et al (1995) J Exp Med. 181:1425- 
143 1). The Jak kinases have also been shown to be activated by numerous ligands that signal 
via cytokine receptors such as, growth hormone and erythropoietin and IL-6 (Kishimoto 
(1994) Stem cells Suppl 12:37-44). 

Detection signals which may be scored for in the present assay, in addition to direct 
detection of second messangers, such as by changes in phosphorylation, includes reporter 
constructs which include transcriptional regulatory elements responsive to the STAT 
proteins. Described infra. 

VI Multisubunit Immune Reco gnition Receptor (MIRR). 

In another embodiment the receptor is a multisubunit receptor. Receptors can be 
comprised of multiple proteins referred to as subunits, one category of which is referred to as 
a multisubunit receptor is a multisubunit immune recognition receptor (MIRR). MIRRs 
include receptors having multiple noncovalently associated subunits and are capable of 
interacting with src-family tyrosine kinases. MIRRs can include, but are not limited to, B 
cell antigen receptors, T cell antigen receptors, Fc receptors and CD22. One example of an 
MIRR is an antigen receptor on the surface of a B cell. To further illustrate, the MIRR on the 
surface of a B cell comprises membrane-bound immunoglobulin (mlg) associated with the 
subunits Ig-a and Ig-p or Ig-y, which forms a complex capable of regulating B cell function 
when bound by antigen. An antigen receptor can be functionally linked to an amplifier 
molecule in a manner such that the amplifier molecule is capable of regulating gene 
transcription. 

Src-family tyrosine kinases are enzymes capable of phosphorylating tyrosine residues 
of a tarfeet molecule. Typically, a src-family tyrosine kinase contains one or more binding 
domains and a kinase domain. A binding domain of a src-family tyrosine kinase is capable of 
binding to a target molecule and a kinase domain is capable of phosphorylating a target 
molecule bound to the kinase. Members of the src family of tyrosine kinases are 
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characterized by an N-terminal unique region followed by three regions that contain different 
degrees of homology among all the members of the family. These three regions are referred 
to as src homology region 1 (SHI), src homology region 2 (SH2) and src homology region 3 
(SH3). Both the SH2 and SH3 domains are believed to have protein association functions 
important for the formation of signal transduction complexes. The amino acid sequence of an 
N-terminal unique region, varies between each src-family tyrosine kinase. An N-terminal 
unique region can be at least about the first 40 amino acid residues of the N-terminal of a src- 
family tyrosine kinase. 

Syk-family kinases are enzymes capable of phosphorylating tyrosine residues of a 
target molecule. Typically, a syk-family kinase contains one or more binding domains and a 
kinase domain. A binding domain of a syk-family tyrosine kinase is capable of binding to a 
target molecule and a kinase domain is capable of phosphorylating a target molecule bound to 
the kinase. Members of the syk- family of tyrosine kinases are characterized by two SH2 
domains for protein association function and a tyrosine kinase domain. 

A primary target molecule is capable of further extending a signal transduction 
pathway by modifying a second messenger molecule. Primary target molecules can include, 
but are not limited to, phosphatidylinositol 3 -kinase (PI-3K), P21 ras GAPase-activating 
protein and associated PI 90 and P62 protein, phospholipases such as PLCyl and PLCy2, 
MAP kinase, She and VAV. A primary target molecule is capable of producing second 
messenger molecule which is capable of further amplifying a transduced signal. Second 
messenger molecules include, but are not limited to diacylglycerol and inositol 1,4,5- 
triphosphate (IP3). Second messenger molecules are capable of initiating physiological 
events which can lead to alterations in gene transcription. For example, production of IP3 
can result in release of intracellular calcium, which can then lead to activation of calmodulin 
kinase II, which can then lead to serine phosphorylation of a DNA binding protein referred to 
as ets-1 proto-onco-protein. Diacylglycerol is capable of activating the signal transduction 
protein, protein kinase C which affects the activity of the API DNA binding protein complex. 
Signal transduction pathways can lead to transcriptional activation of genes such as c-fos, 
egr-1, and c-myc. 

She can be thought of as an adaptor molecule. An adaptor molecule comprises a 
protein that enables two other proteins to form a complex (e.g., a three molecule complex). 
She protein enables a complex to form which includes Grb2 and SOS. She comprises an SH2 
domain that is capable of associating with the SH2 domain of Grb2. 

Molecules of a signal transduction pathway can associate with one another using 
recognition sequences. Recognition sequences enable specific binding between two 
molecules. Recognition sequences can vary depending upon the structure of the molecules 
that are associating with one another. A molecule can have one or more recognition 
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sequences, and as such can associate with one or more different molecules. 

Signal transduction pathways for MIRR complexes are capable of regulating the 
biological functions of a cell. Such functions can include, but are not limited to the ability of 
a cell to grow, to differentiate and to secrete cellular products. MIRR-induced signal 
transduction pathways can regulate the biological functions of specific types of cells involved 
in particular responses by an animal, such as immune responses, inflammatory responses and 
allergic responses. Cells involved in an immune response can include, for example, B cells, 
T cells, macrophages, dendritic cells, natural killer cells and plasma cells. Cells involved in 
inflammatory responses can include, for example, basophils, mast cells, eosinophils, 
neutrophils and macrophages. Cells involved in allergic responses can include, for example 
mast cells, basophils, B cells, T cells and macrophages. 

In exemplary embodiments of the subject assay, the detection signal is a second 
messangers, such as a phosphorylated src-like protein, includes reporter constructs which 
include transcriptional regulatory elements such as serum response element (SRE), 12-0- 
tetradecanoyl-phorbol- 13 -acetate response element, cyclic AMP response element, c- fos 
promoter, or a CREB-responsive element. 

VIL Nucleqr Receptors, 

In another embodiment, the target receptor is a nuclear receptor. The nuclear 
receptors may be viewed as ligand-dependent transcription factors. These receptors provide a 
direct link between extracellular signals, mainly hormones, and transcriptional responses. 
Their transcriptional activation ruction is regulated by endogenous small molecules, such as 
steroid hormones, vitamin D, ecdysone, retinoic acids and thyroid hormones, which pass 
readily through the plasma membrane and bind their receptors inside the cell (Laudet and 
Adelmant (1995) Current Biology 5:124). The majority of these receptors appear to contain 
three domains: a variable amino terminal domain; a highly conserved, DNA-binding domain 
and a moderately conserved, carboxyl-terrninal ligand-binding domain (Power et al. (1993) 
Curr. Opin. Cell Biol. 5:499-504). Examples include the estrogen, progesterone, androgen, 
thyroid hormone and mineralocorticoid receptors. In addition to the known steroid 
receptors, at least 40 orphan members of this superfamily have been identified. (Laudet et 
al., (1992) EMBO J. 11:1003-1013). There are at least four groups of orphan nuclear 
receptors represented by NGF1, FTZ-F1, Rev-erbs, and RARs, which are by evolutionary 
standards, only distantly related to each other (Laudet et al. supra). While the steroid 
hormone receptors bind exclusively as homodimers to a palindrome of their hormone 
responsive element other nuclear receptors bind as heterodimers. Interestingly, some orphan 
receptors bind as monomers to similar response elements and require for their function a 
specific motif that is rich in basic amino-acid residues and is located corboxy-terminal to the 
DNA-binding domain (Laudet and Adelmant supra.) 
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In preferred embodiments, the subject assay is derived to utilize a hormone-dependent 
reporter construct for selection. For instance, glucocorticoid response elements (GREs) and 
thyroid receptor enhancer-like DNA sequences (TREs) can be used to drive expression of 
reporter construct in response to hormone binding to hormone receptors. GRE's are enhancer- 
like DNA sequences that confer glucocorticoid responsiveness via interaction with the 
glucocorticoid receptor. See Payvar, et al. (1983) Cell 35:381 and Schiedereit et al. (1983) 
Nature 304:749. TRE's are similar to GRE's except that they confer thyroid hormone 
responsiveness via interaction with thyroid hormone receptor. Turning now to the interaction 
of hormones and receptors, it is known that a steroid or thyroid hormone enters cells by 
facilitated diffusion and binds to its specific receptor protein, initiating an allosteric alteration 
of the protein. As a result of this alteration, the hormone/receptor complex is capable of 
binding to certain specific sites on transcriptional regulatory sequence with high affinity. 

It is also known that many of the primary effects of steroid and thyroid hormones 
involve increased transcription of a subset of genes in specific cell types. Moreover, there is 
evidence that activation of transcription (and, consequently, increased expression) of genes 
which are responsive to steroid and thyroid hormones (through interaction of chromatin with 
hormone receptor/hormone complex) is effected through binding of the complex to enhancers 
associated with the genes. 

In any case, a number of steroid hormone and thyroid hormone responsive 
transcriptional control units, some of which have been shown to include enhancers, have been 
identified. These include the mouse mammary tumor virus 5'-long terminal repeat (MMTV 
LTR), responsive to glucocorticoid, aldosterone and androgen hormones; the transcriptional 
control units for mammalian growth hormone genes, responsive to glucocorticoids, estrogens, 
and thyroid hormones; the transcriptional control units for mammalian prolactin genes and 
progesterone receptor genes, responsive to estrogens; the transcriptional control units for 
avian ovalbumin genes, responsive to progesterones; mammalian metallothionein gene 
transcriptional control units, responsive to glucocorticoids; and mammalian hepatic alpha 2u 
-globulin gene transcriptional control units, responsive to androgens, estrogens, thyroid 
hormones and glucocorticoids. Such steroid hormone and thyroid hormone responsive 
transcriptional control units can be used to generate reporter constructs which are sensitive to 
agonists and antagonists of the steroid hormone and/or thyroid hormone receptors. See, for 
example, U.S. Patents 5,298,429 and 5,071,773, both to Evans, et. al. Moreover, the art 
describes the functional expression of such receptors in yeast. See also for example, Caplan 
et al. (1995) J Biol Chem 270:5251-7; and Baniahmad et al. (1995) Mol Endocrinol 9: 34-43. 


VIII. Receptor tyrosine kinases 

In still another embodiment, the target receptor is a receptor tyrosine kinase. The 
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receptor tyrosine kinases can be divided into five subgroups on the basis of structural 
similarities in their extracellular domains and the organization of the tyrosine kinase catalytic 
region in their cytoplasmic domains. Sub-groups I (epidermal growth factor (EGF) receptor- 
like), II (insulin receptor-like) and the eph/eck family contain cysteine-rich sequences (Hirai 
et al., (1987) Science 238:1717-1720 and Lindberg and Hunter, (1990) Mol Cell Biol 
10:6316-6324). The functional domains of the kinase region of these three classes of 
receptor tyrosine kinases are encoded as a contiguous sequence ( Hanks et al. (1988) Science 
241:42-52). Subgroups III (platelet-derived growth factor (PDGF) receptor-like) and IV (the 
fibro-blast growth factor (FGF) receptors) are characterized as having immunoglobulin (Ig)- 
like folds in their extracellular domains, as well as having their kinase domains divided in 
two parts by a variable stretch of unrelated amino acids (Yanden and Ullrich (1988) supra 
and Hanks et al. (1988) supra). 

The family with by far the largest number of known members is the EPH family. 
Since the description of the prototype, the EPH receptor (Hirai et al. (1987) Science 
238:1717-1720), sequences have been reported for at least ten members of this family, not 
counting apparently orthologous receptors found in more than one species. Additional partial 
sequences, and the rate at which new members are still being reported, suggest the family is 
even larger (Maisonpierre et al. (1993) Oncogene 8:3277-3288; Andres et al. (1994) 
Oncogene 9:1461-1467; Henkemeyer et al. (1994) Oncogene 9:1001-1014; Ruiz et al. (1994) 
Mech Dev 46:87-100; Xu et al. (1994) Development 120:287-299; Zhou et al. (1994) J 
Neurosci Res 37:129-143; and references in Tuzi and Gullick (1994) Br J Cancer 69:417- 
421). Remarkably, despite the large number of members in the EPH family, all of these 
molecules were identified as orphan receptors without known ligands. 

The expression patterns determined for some of the EPH family receptors have 
implied important roles for these molecules in early vertebrate development. In particular, 
the timing and pattern of expression of sek, mek4 and some of the other receptors during the 
phase of gastrulation and early organogenesis has suggested functions for these receptors in 
the important cellular interactions involved in patterning the embryo at this stage (Gilardi- 
Hebenstreit et al. (1992) Oncogene 7:2499-2506; Nieto et al. (1992) Development 116:1137- 
1 150; Henkemeyer et al., supra; Ruiz et al., supra; and Xu et al., supra). Sek, for example, 
shows a notable early expression in the two areas of the mouse embryo that show obvious 
segmentation, namely the somites in the mesoderm and the rhombomeres of the hindbrain; 
hence the name sek, for segmentally expressed kinase (Gilardi-Hebenstreit et al., supra; Nieto 
et al., supra). As in Drosophila, these segmental structures of the mammalian embryo are 
implicated as important elements in establishing the body plan. The observation that Sek 
expression precedes the appearance of morphological segmentation suggests a role for sek in 
forming these segmental structures, or in detennining segment-specific cell properties such as 
lineage compartmentation (Nieto et al., supra). Moreover, EPH receptors have been 
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implicated, by their pattern of expression, in the development and maintenance of nearly 
every tissue in the embryonic and adult body. For instance, EPH receptors have been 
detected throughout the nervous system, the testes, the cartilaginous model of the skeleton, 
tooth primordia, the infundibular component of the pituitary, various epithelia tissues, lung, 
pancreas, liver and kidney tissues. Observations such as this have been indicative of 
important and unique roles for EPH family kinases in development and physiology, but 
further progress in understanding their action has been severely limited by the lack of 
information on their ligands. 

As used herein, the terms "EPH receptor" or "EPH-type receptor" refer to a class of 
receptor tyrosine kinases, comprising at least eleven paralogous genes, though many more 
orthologs exist within this class, e.g. homologs from different species. EPH receptors, in 
general, are a discrete group of receptors related by homology and easily reconizable, e.g., 
they are typically characterized by an extracellular domain containing a characteristic spacing 
of cysteine residues near the N-terminus and two fibronectin type III repeats (Hirai et al. 
(1987) Science 238:1717-1720; Lindberg et al. (1990) Mol Cell Biol 10:6316-6324; Chan et 
al. (1991) Oncogene 6:1057-1061; Maisonpierre et al. (1993) Oncogene 8:3277-3288; 
Andres et al. (1994) Oncogene 9:1461-1467; Henkemeyer et al. (1994) Oncogene 9:1001- 
1014; Ruiz et al. (1994) Mech Dev 46:87-100; Xu et aL (1994) Development 120:287-299; 
Zhou et aL (1994) J Neurosci Res 37:129-143; and references in Tuzi and Gullick (1994) Br 
J Cancer 69:417-421). Exemplary EPH receptors include the eph, elk,.eck, sek, mek4, hek, 
hek2, eek, erk, tyrol, tyro4, tyro5, tyro6, tyrol 1, cek4, cek5, cek6, cek7, cekS, cek9, ceklO, 
bsk, rtkl y rtk2, rtk3, mykl, myk2, ehkl, ehk2,pagliaccio, htk, erk and nuk receptors. The term 
"EPH receptor" refers to the membrane form of the receptor protein, as well as soluble 
extracellular fragments which retain the ability to bind the ligand of the present invention. 

In exemplary embodiments, the detection signal is provided by detecting 
phosphorylation of intracellular proteins, e.g., MEKKs, MEKs, or Map kinases, or by the use 
of reporter constructs which include transcriptional regulatory elements responsive to c-fos 
and/or c-jun. Described infra. 

IX. G Protein-Coupled Receptors. 

One family of signal transduction cascades found in eukaryotic cells utilizes 
heterotrimeric "G proteins." Many different G proteins are known to interact with receptors. 
G protein signaling systems include three components: the receptor itself, a GTP-binding 
protein (G protein), and an intracellular target protein. 

The cell membrane acts as a switchboard. Messages arriving through different 
receptors can produce a single effect if the receptors act on the same type of G protein. On 
the other hand, signals activating a single receptor can produce more than one effect if the 
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receptor acts on different kinds of G proteins, or if the G proteins can act on different 
effectors. 

In their resting state, the G proteins, which consist of alpha (a), beta (p) and gamma 
(y) subunits, are complexed with the nucleotide guanosine diphosphate (GDP) and are in 
contact with receptors. When a hormone or other first messenger binds to receptor, the 
receptor changes conformation and this alters its interaction with the G protein. This spurs 
the a subunit to release GDP, and the more abundant nucleotide guanosine triphosphate 
(GTP), replaces it, activating the G protein. The G protein then dissociates to separate the a 
subunit from the still complexed beta and gamma subunits. Either the Got subunit, or the GPy 
complex, depending on the pathway, interacts with an effector. The effector (which is often 
an enzyme) in turn converts an inactive precursor molecule into an active "second 
messenger," which may diffuse through the cytoplasm, triggering a metabolic cascade. After 
a few seconds, the Ga converts the GTP to GDP, thereby inactivating itself. The inactivated 
Gcc may then reassociate with the Gpy complex. 

Hundreds, if not thousands, of receptors convey messages through heterotrimeric G 
proteins, of which at least 17 distinct forms have been isolated. Although the greatest 
variability has been seen in the a subunit, several different p and y structures have been 
reported. There are, additionally, several different G protein-dependent effectors. 

Most G protein-coupled receptors are comprised of a single protein chain that is 
threaded through the plasma membrane seven times. Such receptors are often referred to as 
seven-transmembrane receptors (STRs). More than a hundred different STRs have been 
found, including many distinct receptors that bind the same ligand, and there are likely many 
more STRs awaiting discovery. 

In addition, STRs have been identified for which the natural ligands are unknown; 
these receptors are termed "orphan" G protein-coupled receptors, as described above. 
Examples include receptors cloned by Neote et al. (1993) Cell 72, 415; Kouba et al. FEBS 
Lett (1993) 321, 173; Birkenbach et al.(1993) J. Virol. 67, 2209. 

The "exogenous receptors" of the present invention may be any G protein-coupled 
receptor which is exogenous to the cell which is to be genetically engineered for the purpose 
of the present invention. This receptor may be a plant or animal cell receptor. Screening for 
binding to plant cell receptors may be useful in the development of, e.g., herbicides. In the 
case of an animal receptor, it may be of invertebrate or vertebrate origin. If an invertebrate 
receptor, an insect receptor is preferred, and would facilitate development of insecticides. The 
receptor may also be a vertebrate, more preferably a mammalian, still more preferably a 
human, receptor. The exogenous receptor is also preferably a seven transmembrane segment 
receptor. 

Known ligands for G protein coupled receptors include: purines and nucleotides, such 


as adenosine, cAMP, ATP, UTP, ADP, melatonin and the like; biogenic amines (and related 
natural ligands), such as 5 -hydroxytryptamine, acetylcholine, dopamine, adrenaline, 
adrenaline, adrenaline., histamine, noradrenaline, noradrenaline, noradrenaline., 
tyramine/octopamine and other related compounds; peptides such as adrenocorticotrophic 
hormone (acth), melanocyte stimulating hormone (msh), melanocortins, neurotensin (nt), 
bombesin and related peptides, endothelins, cholecystokinin, gastrin, neurokinin b (nk3), 
invertebrate tachykinin-like peptides, substance k (nk2), substance p (nkl), neuropeptide y 
(npy), thyrotropin releasing-factor (trf), bradykinin, angiotensin ii, beta-endorphin, c5a 
anaphalatoxin, calcitonin, chemokines (also called intercrines), corticotrophic releasing factor 
(erf), dynorphin, endorphin, fmlp and other formylated peptides, follitropin (fsh), fungal 
mating pheremones, galanin, gastric inhibitory polypeptide receptor (gip), glucagon-like 
peptides (glps), glucagon, gonadotropin releasing hormone (gnrh), growth hormone releasing 
hormone(ghrh), insect diuretic hormone, interleukin-8, leutropin (lh/hcg), met-enkephalin, 
opioid peptides, oxytocin, parathyroid hormone (pth) and pthrp, pituitary adenylyl cyclase 
activiating peptide (pacap), secretin, somatostatin, thrombin, thyrotropin (tsh), vasoactive 
intestinal peptide (vip), vasopressin, vasotocin; eicosanoids such as ip-prostacyclin, pg- 
prostaglandins, tx-thromboxanes; retinal based compounds such as vertebrate 1 1-cis retinal, 
invertebrate 1 1 -cis retinal and other related compounds; lipids and lipid-based compounds 
such as cannabinoids, anandamide, lysophosphatidic acid, platelet activating factor, 
leukotrienes and the like; excitatory amino acids and ions such as calcium ions and 
glutamate. 

Suitable examples of G-protein coupled receptors include, but are not limited to, 
dopaminergic, muscarinic cholinergic, a-adrenergic, b-adrenergic, opioid (including delta and 
mu), cannabinoid, serotoninergic, and GABAergic receptors. Preferred receptors include the 
5HT family of receptors, dopamine receptors,C5a receptor and FPRL-1 receptor, cyclo- 
histidyl-proline-diketoplperazine receptors, melanocyte stimulating hormone release 
inhibiting factor receptor, and receptors for neurotensin, thyrotropin releasing hormone, 
calcitonin, cholecytokinin-A, neurokinin-2, histamine-3, cannabinoid, melanocortin, or 
adrenomodulin, neuropeptide- Yl or galanin. Other suitable receptors are listed in the art. The 
term "receptor," as used herein, encompasses both naturally occurring and mutant receptors. 

Many of these G protein-coupled receptors, like the yeast a- and a-factor receptors, 
contain seven hydrophobic amino acid-rich regions which are assumed to lie within the 
plasma membrane. Specific human G protein-coupled STRs for which genes have been 
isolated and for which expression vectors could be constructed include those listed herein and 
others known in the art. Thus, the gene would be operably linked to a promoter functional in 
the cell to be engineered and to a signal sequence that also functions in the cell. For example 
in the case of yeast, suitable promoters include Ste2 . Ste3 and gal 10 . Suitable signal 
sequences include those of Ste2 . Ste3 and of other genes which encode proteins secreted by 


yeast cells. Preferably, when a yeast cell is used, the codons of the gene would be optimized 
for expression in yeast. See Hoekema et al.,(1987) Mol Cell Biol, 7:2914-24; Sharp, et al., 
(1986)14:5125-43. 

The homology of STRs is discussed in Dohlman et al., Ann. Rev. Biochem., (1991) 
60:653-88. When STRs are compared, a distinct spatial pattern of homology is discernible. 
The transmembrane domains are often the most similar, whereas the N- and C-terminal 
regions, and the cytoplasmic loop connecting transmembrane segments V and VI are more 
divergent. 

The functional significance of different STR regions has been studied by introducing 
point mutations (both substitutions and deletions) and by constructing chimeras of different 
but related STRs. Synthetic peptides corresponding to individual segments have also been 
tested for activity. Affinity labeling has been used to identify ligand binding sites. 

It is conceivable that a foreign receptor which is expressed in yeast will functionally 
integrate into the yeast membrane, and there interact with the endogenous yeast G protein. 
More likely, either the receptor will need to be modified (e.g., by replacing its V-VI loop 
with that of the yeast STE2 or STE3 receptor), or a compatible G protein should be provided. 

If the wild-type exogenous G protein-coupled receptor cannot be made functional in 
yeast, it may be mutated for this purpose. A comparison would be made of the amino acid 
sequences of the exogenous receptor and of the yeast receptors, and regions of high and low 
homology identified. Trial mutations would then be made to distinguish regions involved in 
ligand or G protein binding, from those necessary for functional integration in the membrane. 
The exogenous receptor would then be mutated in the latter region to more closely resemble 
the yeast receptor, until functional integration was achieved. If this were insufficient to 
achieve functionality, mutations would next be made in the regions involved in G protein 
binding. Mutations would be made in regions involved in ligand binding only as a last resort, 
and then an effort would be made to preserve ligand binding by making conservative 
substitutions whenever possible. 

Preferably, the yeast genome is modified so that it is unable to produce the yeast 
receptors which are homologous to the exogenous receptors in functional form. Otherwise, a 
positive assay score might reflect the ability of a peptide to activate the endogenous G 
protein-coupled receptor, and not the receptor of interest. 

A. Chemoattractant receptors 

The N-formyl peptide receptor is a classic example of a calcium mobilizing G 
protein-coupled receptor expressed by neutrophils and other phagocytic cells of the 
mammalian immune system (Snyderman et al. (1988) In Inflammation: Basic Principles and 


Clinical Correlates, pp. 309-323). N-formyl peptides of bacterial origin bind to the receptor 
and engage a complex activation program that results in directed cell movement, release of 
inflammatory granule contents, and activation of a latent NADPH oxidase which is important 
for the production of metabolites of molecular oxygen. This pathway initiated by receptor- 
ligand interaction is critical in host protection from pyogenic infections. Similar signal 
transduction occurs in response to the inflammatory peptides C5a and IL-8, 

Two other formyl peptide receptor like (FPRL) genes have been cloned based on their 
ability to hybridize to a fragment of the NFPR cDNA coding sequence. These have been 
named FPRL1 (Murphy et al. (1992) J. Biol Chem. 267:7 '637-7 '6 43) and FPRL2 (Ye et al. 
(1992) Biochem Biophys Res. Comm. 184:582-589). FPRL2 was found to mediate calcium 
mobilization in mouse fibroblasts transfected with the gene and exposed to formyl peptide. 
In contrast, although FPRL1 was found to be 69% identical in amino acid sequence to NFPR, 
it did not bind prototype N-formyl peptides ligands when expressed in heterologous cell 
types. This lead to the hypothesis of the existence of an as yet unidentified ligand for the 
FPRL1 orphan receptor (Murphy et al. supra). 

Using the technology described herein a ligand has been cloned for these orphan 
receptors. 

B. G proteins 

In the case of an exogenous G-protein coupled receptor, the yeast cell must be able to 
produce a G protein which is activated by the exogenous receptor, and which can in turn 
activate the yeast effector(s). The art suggests that the endogenous yeast Get subunit (e.g., 
GPA) will be often be sufficiently homologous to the "cognate" Get subunit which is natively 
associated with the exogenous receptor for coupling to occur. More likely, it will be 
necessary to genetically engineer the yeast cell to produce a foreign Get subunit which can 
properly interact with the exogenous receptor. For example, the Got subunit of the yeast G 
protein may be replaced by the Get subunit natively associated with the exogenous receptor. 

Dietzel and Kurjan, (1987) Cell 50:1001) demonstrated that rat Gas functionally 
coupled to the yeast G(5y complex. However, rat Gcti2 complemented only when 
substantially overexpressed, while GotO did not complement at all. Kang, et al., Mol Cell 
Biol, (1990)10:2582). Consequently, with some foreign Got subunits, it is not feasible to 
simply replace the yeast Get. 

If the exogenous G protein coupled receptor is not adequately coupled to yeast Gpy 
by the Got subunit natively associated with the receptor, the Got subunit may be modified to 
improve coupling. These modifications often will take the form of mutations which increase 
the resemblance of the Got subunit to the yeast Got while decreasing its resemblance to the 


receptor-associated Get. For example, a residue may be changed so as to become identical to 
the corresponding yeast Get residue, or to at least belong to the same exchange group of that 
residue. After modification, the modified Got subunit might or might not be "substantially 
homologous" to the foreign and/or the yeast Got subunit. 

The modifications are preferably concentrated in regions of the Ga which are likely 
to be involved in GPy binding. In some embodiments, the modifications will take the form of 
replacing one or more segments of the receptor-associated Ga with the corresponding yeast 
Ga segment(s), thereby forming a chimeric Ga subunit. (For the purpose of the appended 
claims, the term "segment" refers to three or more consecutive amino acids.) In other 
embodiments, point mutations may be sufficient. 

This chimeric Ga subunit will interact with the exogenous receptor and the yeast Gpy 
complex, thereby permitting signal transduction. While use of the endogenous yeast Gpy is 
preferred, if a foreign or chimeric GPy is capable of transducing the signal to the yeast 
effector, it may be used instead. 

C. Gcl Structure 

Some aspects of Ga structure are relevant to the design of modified Ga subunits. 
The amino terminal 66 residues of GPA1 are aligned with the cognate domains of human 
Gas, Gai2, Gai3, Gal6 and transducin. In the GPA41Ga hybrids, the amino terminal 41 
residues (derived from GPA1) are identical, end with the sequence-LEKQRDKNE- and are 
underlined for emphasis. All residues following the glutamate (E) residue at position 41 are 
contributed by the human Ga subunits, including the consensus nucleotide binding motif 
-GxGxxG-. Periods in the sequences indicate gaps that have been introduced to maximize 
alignments in this region. Codon bias is mammalian. For alignments of the entire coding 
regions of GPA1 with Gas, Gai, and GaO, Gaq and Gaz, see Dietzel and Kurjan (1987, 
Cell 50:573) and Lambright, et al. (1994, Nature 369:621-628). Additional sequence 
information is provided by Mattera, et al. (1986, FEBS Lett 206:36-41), Bray, et al. (1986, 
Proc. Natl Acad. Sci USA 83:8893-8897) and Bray, et al. (1987, Proc Natl Acad Sci USA 
84:5115-5119). 

The gene encoding a G protein homolog of S. cerevisiae was cloned independently by 
Dietzel and Kurjan (supra) (SCG1) and by Nakafuku, et al. (1987 Proc Natl Acad Sci 
84:2140-2144) (GPA1). Sequence analysis revealed a high degree of homology between the 
protein encoded by this gene and mammalian Ga. GPA1 encodes a protein of 472 amino 
acids, as compared with approximately 340-350 a.a. for most mammalian Ga subunits in 
four described families, Gas, Gai, Gaq and Gal2/13. Nevertheless, GPA1 shares overall 
sequence and structural homology with all Ga proteins identified to date. The highest overall 
homology in GPA1 is to the Gai family (48% identity, or 65% with conservative 


substitutions) and the lowest is to GQS (33% identity, or 51% with conservativesubstitutions) 
(Nakafuku, et al., supra). 

The regions of high sequence homology among Ga subunits are dispersed throughout 
their primary sequences, with the regions sharing the highest degree of homology mapping to 
sequence that comprises the guanine nucleotide binding/GTPase domain. This domain is 
structurally similar to the ap fold of ras proteins and the protein synthesis elongation factor 
EF-Tu. This highly conserved guanine nucleotide-binding domain consists of a six-stranded 
P sheet surrounded by a set of five ct-helices. It is within these p sheets and a helices that the 
highest degree of conservation is observed among all Ga proteins, including GPA1. The least 
sequence and structural homology is found in the intervening loops between the p sheets and 
a helices that define the core GTPase domain. There are a total of four "intervening loops" or 
"inserts" present in all Ga subunits. In the crystal structures reported to date for the GDP- 
and GTPyS-liganded forms of bovine rod transducin (Noel, et al. (1993) Nature 366:654- 
663); (Lambright, et al. (1994) Nature 369:621-628), the loop residues are found to be 
outside the core GTPase structure. Functional roles for these loop structures have been 
established in only a few instances. A direct role in coupling to phosphodiesterase-y has been 
demonstrated for residues within inserts 3 and 4 of Gat (Rarick, et al. (1992) Science 
256:1031-1033); (Artemyev, et al. (1992) X Biol Chem. 267:25067-25072), while a 
"GAP-like" activity has been ascribed to the largely a-helical insert 1 domain of GaS 
(Markby, et al. (1993) Science 262:1805-1901). 

While the amino- and carboxy-termini of Ga subunits do not share striking homology 
either at the primary, secondary, or tertiary levels, there are several generalizations that can 
be made about them. First, the amino termini of Ga subunits have been implicated in the 
association of Ga with Gpy complexes and in membrane association via N-terminal 
myristoylation. In addition, the carboxy-termini have been implicated in the association of 
GaPy heterotrimeric complexes with G protein-coupled receptors (Sullivan, et al. (1987) 
Nature 330:758-760); West, et al. (1985) J. Biol Chem. 260:14428-14430); (Conklin, et al. 
(\993)Nature 363:274-276). Data in support of these generalizations about the function of 
the N-terminus derive from several sources, including both biochemical and genetic studies. 

As indicated above, there is little if any sequence homology shared among the amino 
termini of Ga subunits. The amino terminal domains of Ga subunits that precede the first 
p-sheet (containing the sequence motif -LLLLGAGESG-; see Noel, et al. (supra) for the 
numbering of the structural elements of Ga subunits) vary in length from 41 amino acids 
(GPA1) to 31 amino acids (Gat). Most Ga subunits share the consensus sequence for the 
addition of myristic acid at their amino termini (MGxxxS-), although hot all Ga subunits that 
contain this motif have myristic acid covalently associated with the glycine at position 2 
(Speigel, et al. (1991) TIBS 16:338-3441). The role of this post-translational modification has 
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been inferred from studies in which the activity of mutant Got subunits from which the 
consensus sequence for myristoylation has been added or deleted has been assayed (Mumby 
et al. (1990) Proc. Natl Acad. Set USA 87: 728-732; (Under, et al. (1991) J. Biol Chem. 
266:4654-4659); Gallego, et al. (1992) Proc. Natl. Acad. Sci. USA 89:9695-9699). These 
studies suggest two roles for N-terminal myristoylation. First, the presence of animo-terminal 
myristic acid has in some cases been shown to be required for association of Get subunits 
with the membrane, and second, this modification has been demonstrated to play a role in 
modulating the association of Get subunits with GPy complexes. The role of myristoylation 
of the GPA1 gene products, at present, unknown. 

In other biochemical studies aimed at examining the role of the amino-terminus of Ga 
in driving the association between Ga and GPy subunits, proteolytically or genetically 
truncated versions of Ga subunits were assayed for their ability to associate with 
Gpycomplexes, bind guanine nucleotides and/or to activate effector molecules. In all cases, 
Ga subunits with truncated amino termini were deficient in all three functions (Graf, et al. 
(1992) J. Biol Chem. 267:24307-24314); (Journot, et al. (1990) J. Biol Chem. 265:9009- 
9015); and (Neer, et al. (1988) J. Biol Chem 263:8996-9000). Slepak, et al. (1993, J. Biol. 
Chem. 268:1414-1423) reported a mutational analysis of the N-terminal 56 a.a. of 
mammalian Gao expressed in Escherichia coli. Molecules with an apparent reduced ability 
to interact with exogenously added mammalian Gpy were identified in the mutant library. As 
the authors pointed out, however, the assay used to screen the mutants the extent of 
ADP-ribosylation of the mutant Ga by pertussis toxin was not a completely satisfactory 
probe of interactions between Ga and Gpy. Mutations identified as inhibiting the interaction 
of the subunits, using this assay, may still permit the complexing of Ga and Gpy while 
sterically hindering the ribosylation of Ga by toxin. Genetic studies examined the role of 
amino-terminal determinants of Ga in heterotrimer subunit association have been carried out 
in both yeast systems using GPA1 -mammalian Ga hybrids (Kang, et al. (1990) Mol Cell 
Biol. 10:2582-2590) and in mammalian systems using Gai/Gas hybrids (Russell and 
Johnson (1993) Mol Pharmacol. 44:255-263). In the former studies, gene fusions, composed 
of yeast GPA1 and mammalian Ga sequences were constructed by Kang, et al. (supra) and 
assayed for their ability to complement a gpal null phenotype (i.e., constitutive activation of 
the pheromone response pathway) in S. cerevisiae. Kang, et al. demonstrated that wild type 
mammalian Gas, Gai but not Gao proteins are competent to associate with yeast Ga and 
suppress the gpal null phenotype, but only when overexpressed. Fusion proteins containing 
the arnino-terminal 330 residues of GPA1 sequence linked to 160, 143, or 142 residues of the 
mammalian Gas, Gai and Gao carrx>xyl-terminal regions, respectively, also coupled to the 
yeast mating response pathway when overexpressed on high copy plasmids with strong 
inducible (CUP) or constitutive (PGK) promoters. All three of these hybrid molecules were 
able to complement the gpal null mutation in a growth arrest assay, and were additionally 


able to inhibit ctfactor responsiveness and mating in tester strains. These last two 
observations argue that hybrid yeast-mammalian Gcc subunits are capable of interacting 
directly with yeast GPy, thereby disrupting the normal function of the yeast heterotrimer. 
Fusions containing the amino terminal domain of Gas, Gai or Gao, however, did not 
complement the gpal null phenotype, indicating a requirement for determinants in the amino 
terminal 330 amino acid residues of GPA1 for association and sequestration of yeast G(3y 
complexes. Taken together, these data suggest that determinants in the amino terminal region 
of Got subunits determine not only the ability to associate with GPy subunits in general, but 
also with specific GPy subunits in a species-restricted manner. 

Hybrid Gcti/Gots subunits have been assayed in mammalian expression systems 
(Russell and Johnson (supra). In these studies, a large number of chimeric Got subunits were 
assayed for an ability to activate adenylyl cyclase, and therefore, indirectly, for an ability to 
interact with Gpy(i.e., coupling of Get to GPy= inactive cyclase; uncoupling of Ga from GPy 
= active cyclase). From these studies a complex picture emerged in which determinants in the 
region between residues 25 and 96 of the hybrids were found to determine the state of 
activation of these alleles as reflected in their rates of guanine nucleotide exchange and GTP 
hydrolysis and the extent to which they activated adenylyl cyclase in vivo. These data could 
be interpreted to support the hypothesis that structural elements in the region between the 
amino terminal methionine and the ~1 sheet identified in the crystal structure of Gat (see 
Noel, et al. supra and Lambright, et ah supra) are involved in determining the state of 
activity of the heterotrimer by (1) driving association/dissociation between Gaand GPy 
subunits; (2) driving GDP/GTP exchange. While there is no direct evidence provided by 
these studies to support the idea that residues in this region of Ga and residues in GPy 
subunits contact one another, the data nonetheless provide a positive indication for the 
construction of hybrid Ga subunits that retain function. There is, however, a negative 
indicator that derives from this work in that some hybrid constructs resulted in constitutive 
activation of the chimeric proteins (i.e., a loss of receptor-dependent stimulation of GPy 
dissociation and effector activation). 

D. Construction of chimeric Get subunits. 

In designing Ga subunits capable of transmitting, in yeast, signals originating at 
mammalian G protein-coupled receptors, two general desiderata were recognized. First, the 
subunits should retain as much of the sequence of the native mammalian proteins as possible. 
Second, {he level of expression for the heterologous components should approach, as closely 
as possible, the level of their endogenous counterparts. The results described by King, et al. 
(1990, Science 250:121-123) for expression of the human p2-adrenergic receptor and Gas in 
yeast, taken together with negative results obtained by Kang, et al. (supra) with full-length 


mammalian Get subunits other than Gas, led us to the following preferences for the 
development of yeast strains in which mammalian G protein-coupled receptors could be 
linked to the pheromone response pathway. 

1. Mammalian Got subunits will be expressed using the native sequence of each 
subunit or, alternatively, as minimal gene fusions with sequences from the amino- terminus 
of GPA1 replacing the homologous residues from the mammalian Get subunits. 

2. Mammalian Got subunits will be expressed from the GPA1 promotor either on low 
copy plasmids or after integration into the yeast genome as a single copy gene. 

3. Endogenous GPy subunits will be provided by the yeast STE4 and STE18 loci. 

E. Site-Directed Mutagenesis versus Random Mutagenesis 

There are two general approaches to solving stmcture-function problems of the sort 
presented by attempts to define the determinants involved in mediating the association of the 
subunits that comprise the G protein heterotrimer. The first approach, discussed above with 
respect to hybrid constructs, is a rational one in which specific mutations or alterations are 
introduced into a molecule based upon the available experimental evidence. In a second 
approach, random mutagenesis techniques, coupled with selection or screening systems, are 
used to introduce large numbers of mutations into a molecule, and that collection of 
randomly mutated molecules is then subjected to a selection for the desired phenotype or a 
screen in which the desired phenotype can be observed against a background of undesirable 
phenotypes. With random mutagenesis one can mutagenize an entire molecule or one can 
proceed by cassette mutagenesis. In the former instance, the entire coding region of a 
molecule is mutagenized by one of several methods (chemical, PCR, doped oligonucleotide 
synthesis) and that collection of randomly mutated molecules is subjected to selection or 
screening procedures. Random mutagenesis can be applied in this way in cases where the 
molecule being studied is relatively small and there are powerful and stringent selections or 
screens available to discrirninate between the different classes of mutant phenotypes that will 
inevitably arise. In the second approach, discrete regions of a protein, corresponding either to 
defined structural (i.e. ct-helices, p -sheets, turns, surface loops) or functional determinants 
(e.g., catalytic clefts, binding detenmnants, transmembrane segments) are subjected to 
saturating or semi-random mutagenesis and these mutagenized cassettes are re-introduced 
into the context of the otherwise wild type allele. Cassette mutagenesis is most useful when 
there is experimental evidence available to suggest a particular function for a region of a 
molecule and there is a powerful selection and/or screening approach available to 
discriminate between interesting and uninteresting mutants. Cassette mutagenesis is also 
useful when the parent molecule is comparatively large and the desire is to map the 
functional domains of a molecule by mutagenizing the molecule in a step-wise fashion, i.e. 



mutating one linear cassette of residues at a time and then assaying for function. 

The present invention contemplates applying random mutagenesis in order to further 
delineate the determinants involved in Got-GPy association. Random mutagenesis may be 
accomplished by many means, including: 

1. PCR mutagenesis, in which the error prone Taq polymerase is exploited to generate 
mutant alleles of Gcc subunits, which are assayed directly in yeast for an ability to couple to 
yeast Gpy. 

2. Chemical mutagenesis, in which expression cassettes encoding Get subunits are 
exposed to mutagens and the protein products of the mutant sequences are assayed directly in 
yeast for an ability to couple to yeast G py. 

3. Doped synthesis of oligonucleotides encoding portions of the Ga gene. 

4. In vivo mutagenesis, in which random mutations are introduced into the coding 
region of Ga subunits by passage through a mutator strain of E. coli, XL 1 -Red (mutD5 mutS 
mutT) (Stratagene, Menasa, WI). 

The random mutagenesis may be focused on regions suspected to be involved in 
Ga-GPy association as discussed in the next section. Random mutagenesis approaches are 
feasible for two reasons. First, in yeast one has the ability to construct stringent screens and 
facile selections (growth vs. death, transcription vs. lack of transcription) that are not readily 
available in mammalian systems. Second, when using yeast it is possible to screen efficiently 
through thousands of transformants rapidly. Cassette mutagenesis is immediately suggested 
by the observation (see infra) that the GPA41 hybrids couple to the pheromone response 
pathway. This relatively small region of Ga subunits represents a reasonable target for this 
type of mutagenesis. Another region that may be amenable to cassette mutagenesis is that 
defining the surface of the switch region of Ga subunits that is solvent-exposed in the crystal 
structures of Gai and transducin. From the data described below, this surface may contain 
residues that are in direct contact with yeast GPy subunits, and may therefore be a reasonable 
target for mutagenesis. 


F. Rational Design of Chimeric Ga Subunits 

Several classes of rationally designed GPAl-mammalian Ga hybrid subunits have 
been tested for the ability to couple to yeast Py. The first, and largest, class of hybrids are 
those that encode different lengths of the GPA1 amino terminal domain in place of the 
homologous regions of the mammalian Ga subunits. This class of hybrid molecules includes 
GPA BAMH1 > GPA41, GPAjd, and GPAlv/ hybrids, described below. The rationale for 
constructing these hybrid Ga proteins is based on results, described above, that bear on the 
importance of the amino terminal residues of Ga in mediating interaction with GPy. 
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Preferably, the yeast Ga subunit is replaced by a chimsrk Ga subunit in which a 
portion, e.g., at least about 20, more preferably at least about 40, amino acids, which is 
substantially homologous with the corresponding residues of the amino terminus of the yeast 
Ga, is fused to a sequence substantially homologous with the main body of a mammalian (or 
other exogenous) Ga. While 40 amino acids is the suggested starting point, shorter or longer 
portions may be tested to determine the minimum length required for coupling to yeast GBy 
and the maximum length compatible with retention of coupling to the exogenous receptor It 
is presently believed that only the final 10 or 20 amino acids at the carboxy terminus of the 
Ga subunit are required for interaction with the receptor. 

GEAEAMHl-hjirids. Kang et al. supra, described hybrid G a subunits encoding the 
ammo terminal 310 residues of GPA1 fused to the carboxyl terminal 160 143 and 142 
residues, respectively, of GaS, Gai2, and Gao. In all cases examined by Kang et al the 
hybrid proteins were able to complement the growth arrest phenotype of gpal strains' We 
have confirmed these findings and, in addition, have constructed and tested hybrids between 
GPA1 and Gai3, Gaq and Gal6. AH hybrids of this type that have been tested functionally 
complement the growth arrest phenotype of gpal strains. 

GPA41 hybrids. The rationale for constructing a minimal hybrid encoding only 41 
ammo ac.ds of GPA1 relies upon the biochemical evidence for the role of the amino-terminus 
of Ga subunits discussed above, together with the following observation. G p and Gy 
subunits are known to interact via «- helical domains at their respective amino-termini 
(Pronin, et al. (1992) Proc. Natl. Acad. Sci. USA 89:6220-6224); Garritsen, et al 1993) The 
suggestion that the amino termini of Ga subunits may form an helical coil and that this 
helical coil may be involved in association of Ga with Gpy (Masters et al (1986) Protein 
Engineering 1:47-54); Lupas et al.(1992) FEBS Lett. 314:105-108) leads to the hypothesis 
that the three subunits of the G-protein heterotrimer interact with one another reversibly 
through the winding and unwinding of their amino-terminal helical regions. A mechanism of 
this type has been suggested, as well, from an analysis of leucine zipper mutants of the GCN4 
transcription factor (Harbury, et al. (1993) Science 262:1401-1407). The rationale for 
constructing hybrids like those described by Kang, et al. supra., that contain a majority of 
yeast sequence and only minimal mammalian sequence, derives from their ability to function 
m assays of coupling between Ga and G P y subunits. However, these chimeras had never 
been assayed for an ability to couple to both mammalian G protein-coupled receptors and 
yeast GPy subunits, and hence to reconstitute a hybrid signaling pathway in yeast. 

GPA4! hybrids that have been constructed and tested include Gas, Gai2 Gai3 Gaq 
Gao a , Gciob and Gal6. Hybrids of Gas, Gai2, Gai3, and Gal6 functionally' complement 
the growth arrest phenotype of gpal strains, while GPA41 hybrids of Gao a and Gao h do 
not. In addttion to being tested in a growth arrest assay, these constructs have been assayed in 


the more sensitive transcriptional assay for activation of a fuslp-fflS3 gene. In both of these 
assays, the GPA^-Gcts hybrid couples less well than the GPa4j-i2, -i3, and -16 hybrids, 
while the GPot4i ~°a> 311(1 "°b hyrids do not function in either assay. 

Several predictive algorithms indicate that the amino terminal domain up to the 
highly conserved sequence motif-LLLLGAGESG- (the first L in this motif is residue 43 in 
GPA1) forms a helical structure with amphipathic character. Assuming that a heptahelical 
repeat unit, the following hybrids between GPA1 and GctS can be used to define the number 
of helical repeats in this motif necessary for hybrid function: 

GPAl-7/Gas8-394 

GPAl-14/Gasl5-394 

GPAl-21/Gas22-394 

GPAl-28/Gas29-394 

GPAl-35/Gas36-394 

GPAl-42/Gas43-394 

In these hybrids, the prediction is that the structural repeat unit in the amino terminal 
domain up to the tetra-leucine motif is 7, and that swapping sequences in units of 7 will in 
effect amount to a swap of unit turns of turns of the helical structure that comprises this 
domain. 

A second group of "double crossover"' hybrids of this class are those that are aligned 
on the first putative heptad repeat beginning with residue Gil in GPA1. In these hybrids, 
helical repeats are swapped from GPA1 into a GaS backbone one heptad repeat unit at a time. 

GctSl-10/GPAl 1-17/Gas 18-394 

GaS 1 - 1 7/GPA 1 8-24/GaS25-394 

GaS 1-1 7/GPA25-3 l/GaS32-394 

GaS?-17/GPA32-38/GaS39-394 

The gap that is introduced between residues 9 and 10 in the GaS sequence is to 
preserve the alignment of the -LLLLGAGE-sequence motif. This class of hybrids can be 
complemented by cassette mutagenesis of each heptad repeat followed by screening of these 
collections of "heptad" libraries in standard coupling assays. 

A third class of hybrids based on the prediction that the amino terminus forms a 
helical domain with a heptahelical repeat unit are those that effect the overall hydrophobic or 
hydrophilic character of the opposing sides of the predicted helical structure (See Lupas et al. 
supra). In this model, the a and d positions of the heptad repeat abcdefg are found to be 
conserved hydrophobic residues that define one face of the helix, while the e and g positions 
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define the charged face of the helix. In this class of hybrids, the sequence of the GaS parent 

is maintained except for specific substitutions at one or more of the following critical 

residues to render the different helical faces of GaS more "GPAl-like" 
K8Q 
+1-10 
ElOG 
Q12E 
R13S 
N14D 
E15P 
E15F 
K17L 
E21R 
K28Q 
K32L 
V36R 

This collection of single mutations could be screened for coupling efficiency to yeast 
Gpy and then constructed in combinations (double and greater if necessary). 

A fourth class of hybrid molecules that span this region of GPAl-Gcc hybrids are 
those that have junctions between GPA1 and Gcc subunits introduced by three primer PCR. 
In this approach, the two outside primers are encoded by sequences at the initiator 
methionine of GPA1 on the 5' side and at the tetraleucine motif of GaS (for example) on the 
3' side. A series of junctional primers spanning different junctional points can be mixed with 
the outside primers to make a series of molecules each with different amounts of GPA1 and 
GaS sequences, respectively. 

GPA jp and GPA j_\ y hybrids . The regions of high homology among G|3y subunits 
that have been identified by sequence alignment are interspersed throughout the molecule. 
The Gl region containing the highly conserved -GSGESGDST- motif is followed 
immediately by a region of very low sequence consevation, the "il" or insert 1 region. Both 
sequence and length vary considerably among the il regions of the Ga subunits. By aligning 
the sequences of Ga subunits, the conserved regions bounding the il region were identified 
and two additional classes of GPAl-Ga hybrids were constructed. The GPAjd hybrids 
encode the amino terminal 102 residues of GPA1 (up to the sequence -QARKLGIQ-) fused 
in frame to mammalian Ga subunits, while the GPALW hybrids encode the amino terminal 
244 residues of GPA1 (up to the sequence LIHEDIAKA- in GPA1). The reason for 
constructing the GPAjd and GPAlw^ hybrids was to test the hypothesis that the il region of 
GPA1 is required for mediating the interaction of GPA1 with yeast Gpy subunits, for the 
stable expression of the hybrid molecules, or for function of the hybrid molecules. The 
GPAjd hybrids contain the amino terminal domain of GPA1 fused to the il domain of 
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marnrnalian subunits, and therefore do not contain the GPA1 il region, while the GPA LW 
hybrids contain the amino terminal 244 residues of GPA1 including the entire il region (as 
defined by sequence alignments). Hybrids of both GPA ID and GPA LW classes were 
constructed for GaS, C-«i2, Gai3, Gao a , and Gal6; none of these hybrids complemented 
the gpal growth arrest phenotype. 

Subsequent to the construction and testing of the GPA ID and GPALW classes of 
hybrids, the crystal structures of 0*^^ in both the GDP and GTPyS-liganded form and 
the crystal structure of several Gail variants in the GTPyS-liganded and GDP-AIF4 forms 
were reported (Noel et al. supra; Lambright et al. supra; and Coleman et al.(1994) Science 
265:1405-1412). The crystal structures reveal that the ilregion defined by sequence alignment 
has a conserved structure that is comprised of six alpha helices in a rigid array, and that the 
juncnons chosen for the construction of the GPA ID and GPA LW hybrids were not 
compatible with conservation of the structural features of the il region observed in the 
crystals. The junction chosen for the GPA ID hybrids falls in the center of the long «A helix- 
chimerization of this helix in all likelihood destabilizes it and the protein structure in general' 
The same is true of the junction chosen for the GPA LW hybrids in which the crossover point 
between GPA1 and the mammalian Ga subunit falls at the end of the short aC helix and 
therefore may distort it and destabilize the protein. 

The failure of the GPA ID and GPA LW hybrids is predicted to be due to disruption of 
critical structural elements in the il region as discussed above. Based upon new alignments 
and the data presented in Noel et al (supra), Lambright et al (supra), and Coleman et al 
(supra), this problem can be averted with the ras-like core domain and the il helical domain 
are introduced sajtsidfi of known structural elements like alpha-helices. 

Hybrid A GaS 1 -67/GPA66-299/GaS203-394 

This hybrid contains the entire il insert of GPA1 interposed into the 
GaS sequence. 


Hybrid B GPAl-41/GaS4443-67/GPA66-299/GaS203-394 
This hybrid contains the amino terminal 41 residues of GPA1 in 
place of the 42 amino terminal residues of GaS found in Hybrid A. 

Gas_HyMds. There is evidence that the "switch region" encoded by residues 171-237 of Ga 
transduciri (using the numbering of (Noel et al (supra) also plays a role in G P y coupling 
First, the G226A mutation in GaS prevents the GTP-induced conformational change that 
occurs with exchange of GDP for GTP upon receptor activation by ligand. This residue maps 
to the highly conserved sequence -DVGGQ-, present in all Ga subunits and is involved in 


GTP hydrolysis. In both the Gat and Get il crystal structures, this sequence motif resides in 
the loop that connects the £3 sheet and the al helix in the guanine nucleotide binding core. 
In addition to blocking the conformational change that occurs upon GTP binding, this 
mutation also prevents dissociation of GTP-liganded Gas from GPy. Second, crosslinking 
data reveals that a highly conserved cysteine residue in the ctl helix (C215 in Gao, C210 in 
Gat) can be crosslinked to the carboxy terminal region of Gp subunits. Finally, genetic 
evidence (Whiteway et al. (1993) Mol Cell Biol. 14:3233-3239) identifies an important 
single residue in GPA1 (E307) in the p2 sheet of the core structure that may be in direct 
contact with py. A mutation in the GPA1 protein at this position suppresses the constitutive 
signalling phenotype of a variety of STE4 (Gp) dominant negative mutations that are also 
known to be defective in Ga-Gpy association (as assessed in two-hybrid assay in yeast as 
well as by more conventional genetic tests). 

We have tested the hypothesis that there are switch region determinants involved in 
the association of Ga with GPy by constructing a series of hybrid Ga proteins encoding 
portions of GPA1 and GaS in different combinations. 

Two conclusions may be drawn. First, in the context of the amino terminus of GaS, 
the GPA1 switch region suppresses coupling to yeast Gpy (SGS), while in the context of the 
GPA1 amino terminus the GPA1 switch region stabilizes coupling with GPy (GPpy-SGS). 
This suggests that these two regions of GPA1 collaborate to allow interactions between Ga 
subunits and GPy subunits. This conclusion is somewhat mitigated by the observation that 
the GPA4i-Gas hybrid that does not contain the GPA1 switch region is able to complement 
the growth arrest phenotype of gpal strains. We have not to date noted a quantitative 
difference between the behavior of the GPA^-Gas allele and the GPA-I-SGS allele, but if 
this interaction is somewhat degenerate, then it may be difficult to quantitate this accurately. 
The second conclusion that can be drawn from these results is that there are other 
determinants involved in stabilizing the interaction of Ga with Gpy beyond these two regions 
as none of the GPAl/Gas hybrid proteins couple as efficiently to yeast GPy as does native 
GPAl. 

The role of the surface-exposed residues of this region may be crucial for effective 
coupling to yeast Gpy, and can be incorporated into hybrid molecules as follows below. 

GaS-GPA-Switch GaS l-202/GPA298-350/GaS 253-394 
This hybrid encodes the entire switch region of GPA 1 in the context of GaS. 

GaS-GPA-a2 GQS 1-226/GPA322-332/GQS 238-394 
This hybrid encodes the a 2 helix of GPAl in the context of GaS. 


GPA41-GaS-GPA-a2GPAl-41/GQS43-226/GPA322-332/GQS238-394 

This hybrid encodes the 41 residue amino terminal domain of GPA1 and the a2 helix 
of GPA1 in the context of GaS. 

Finally, the last class of hybrids that will be discussed here are those that alter the 

surface exposed residues of the p2 and p3 sheets of aS so that they resemble those of the 

GPA1 QS helix. These altered a2 helical domains have the following structure. (The 

positions of the altered residues correspond to GaS.) 
L203K 
K211E 
D215G 
K216S 
D229S 

These single mutations can be engineered into a GaS backbone singly and in pairwise 
combinations. In addition, they can be introduced in the context of both the full length GaS 
and the GPA4j-GaS hybrid described previously. All are predicted to improve the coupling 
of Ga subunits to yeast Gpy subunits by virtue of improved electrostatic and hydrophobic 
contacts between this region and the regions of Gp defined by Whiteway and coworkers 
(Whiteway et al (supra) that define site(s) that interact with GPA1), 

In summary, the identification of hybrid Ga subunits that couple to the yeast 
pheromone pathway has led to the following general observations. First, all GPAb^MHI 
hybrids associate with yeast GPy, therefore at a minimum these hybrids contain the 
determinants in GPA1 necessary for coupling to the pheromone response pathway. Second, 
the amino terminal 41 residues of GPA1 contain sufficient determinants to facilitate coupling 
of Ga hybrids to yeast Gpy in some, but not all, instances, and that some Ga subunits contain 
regions outside of the first 41 residues that are sufficiently similar to those in GPA1 to 
facilitate interaction with GPA1 even in the absence of the amino terminal 41 residues of 
GPA1. Third, there are other determinants in the first 310 residues of GPA1 that are involved 
in coupling Ga subunits to yeast GPy subunits. 

The various classes of hybrids noted above are not mutually exclusive. For example, a 
GPA1 containing GPAl-4 \ could also feature the L203K mutation. 

While, for the sake of simplicity, we have described hybrids of yeast GPA1 and a 
mammalian Gas, it will be appreciated that hybrids may be made of other yeast Ga subunits 
and/or other mammalian Ga subunits, notably mammalian Gai subunits. Moreover, while 
the described hybrids are constructed from two parental proteins, hybrids of three or more 
parental proteins are also possible. 

As shown in the Examples, chimeric Ga subunits have been especially useful in 


coupling receptors to Gai species. 


G. Expression of Ga 

Kang et al. supra reported that several classes of native mammalian G~ subunits were 
able to interact functionally with yeast a subunits when expression of Ga was driven from a 
constitutively active, strong promoter (PGK) or from a strong inducible promoter (CUP). 
These authors reported that rat GotS, Goti2 or Goto expressed at high level coupled to yeast 
Py. High level expression of mammalian Ga (i.e. non-stoichiometric with respect to yeast py) 
is noj desirable for uses like those described in this application. Reconstruction of G protein- 
coupled receptor signal transduction in yeast requires the signalling component of the 
heterotrimeric complex (Gpy) to be present stoichiometrically with Ga subunits. An excess 
of Ga subunits (as was required for coupling of mammalian Gai2 and Gao to yeast Gpy in 
Kang et al.) would dampen the signal in systems where Gpy subunits transduce the signal. An 
excess of Ga subunits raises the background level of signaling in the system to unacceptably 
high levels. Preferably, levels of Ga and GPy subunits are balanced. For example, 
heterologous Ga subunits may be expressed from a low copy (CEN ARS) vector containing 
the endogenous yeast GPA1 promoter and the GPA1 3' untranslated region. The minimum 
criterion, applied to a heterologous Gasubunit with respect to its ability to couple 
functionally to the yeast pheromone pathway, is that it complement a gpal genotype when 
expressed from the GPA1 promoter on low copy plasmids or from an integrated, single copy 
gene. In the work described in this application, all heterologous Ga subunits have been 
assayed in two biological systems. In the first assay heterologous Ga subunits are tested for 
an ability to functionally complement the growth arrest phenotype of gpal strains. In the 
second assay the transcription of a fusl-HIS3 reporter gene is used to measure the extent to 
which the pheromone response pathway is activated, and hence the extent to which the 
heterologous Ga subunit sequesters the endogenous yeast Gpy complex. Mammalian Gas, 
Gai2, Gai3, Gaq, Gall, Gal 6, Gao a , Gao D , and Gaz from rat, murine or human origins 
were expressed from a low copy, CEN ARS vector containing the GPA1 promoter. 
Functional complementation of gpal strains was not observed in either assay system with any 
of these full-length Ga constructs with the exception of rat and human GaS. 

H. Chimeric Yeast py subunits 

An alternative to the modification of a mammalian Ga subunit for improved signal 
transduction is the modification of the pertinent sites in the yeast Gp or Gy subunits. The 
principles discussed already with respect to Ga subunits apply, mutatis mutandis , to yeast Gp 
orGy. 


For example, it would not be unreasonable to target the yeast Ste4p Gpsubunit with 
cassette mutagenesis. Specifically, the region of Ste4p that encodes several of the dominant 
negative, signaling-defective mutations would be an excellent target for cassette mutagenesis 
when looking for coupling of yeast Gpy to specific mammalian Got subunits. 

X Peptide Libraries 

While others have engineered yeast cells to facilitate screening of exogenous drugs as 
receptor agonists and antagonists, the cells did not themselves produce both the drugs and the 
receptors. Yeast cells engineered to produce the receptor, but that do not produce the drugs 
themselves, are inefficient. To utilize them one must bring a sufficient concentration of each 
drug into contact with a number of cells in order to detect whether or not the drug has an 
action. Therefore, a microtiter plate well or test tube must be used for each drug. The drug 
must be synthesized in advance and be sufficiently pure to judge its action on the yeast cells. 
When the yeast cell produces the drug, the effective concentration is higher. 

Peptide libraries are systems which simultaneously display, in a form which permits 
interaction with a target, a highly diverse and numerous collection of peptides. These 
peptides may be presented in solution (Houghten (1992) Biotechniques 13:412-421), or on 
beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria 
(Ladner USP 5,223,409), spores (Ladner USP '409), plasmids (Cull et al. (1992) Proc Natl 
Acad Sci USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390); 
(Devlin (\99Q)Science 249:404-406); (Cwirla et al. (1990) Proc. Natl. Acad Sci. 87:6378- 
6382); (Felici (1991) J. Mol Biol 222:301-310); (Ladner supra.). Many of these systems are 
limited in terms of the maximum length of the peptide or the composition of the peptide (e.g., 
Cys excluded). Steric factors, such as the proximity of a support, may interfere with binding. 
Usually, the screening is for binding in vitro to an artificially presented target, not for 
activation or inhibition of a cellular signal transduction pathway in a living cell. While a cell 
surface receptor may be used as a target, the screening will not reveal whether the binding of 
the peptide caused an allosteric change in the conformation of the receptor. 

The Ladner et al. patent, USSN 5,096,815, describes a method of identifying novel 
proteins or polypeptides with a desired DNA binding activity. Semi-random ("variegated") 
DNA encoding a large number of different potential binding proteins is introduced, in 
expressible form, into suitable host cells. The target DNA sequence is incorporated into a 
genetically engineered operon such that the binding of the protein or polypeptide will prevent 
expressfon'of a gene product that is deleterious to the gene under selective conditions. Cells 
which survive the selective conditions are thus cells which express a protein which binds the 
target DNA. While it is taught that yeast cells may be used for testing, bacterial cells are 
preferred. The interactions between the protein and the target DNA occur only in the cell 
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(and then only in the nucleus), not in the periplasm or cytoplasm, and the target is a nucleic 
acid, and not a receptor protein. Substitution of random peptide sequences for functional 
domains in cellular proteins permits some determination of the specific sequence 
requirements for the accomplishment of function. Though the details of the recognition 
phenomena which operate in the localization of proteins within cells remain largely 
unknown, the constraints on sequence variation of mitochondrial targeting sequences and 
protein secretion signal sequences have been elucidated using random peptides (Lemire et al., 
J. Biol Chem.(\9S9) 264, 20206 and Kaiser et al. (1987) Science 235:312, respectively). 

The peptide library of the present invention takes the form of a cell culture, in which 
essentially each cell expresses one, and usually only one, peptide of the library. While the 
diversity of the library is maximized if each cell produces a peptide of a different sequence, it 
is usually prudent to construct the library so there is some redundancy. Depending on size, 
the combinatorial peptides of the library can be expressed as is, or can be incorporated into 
larger fusion proteins. The fusion protein can provide, for example, stability against 
degradation or denaturation, as well as a secretion signal if secreted. In an exemplary 
embodiment of a library for intracellular expression, e.g., for use in conjunction with 
intracellular target receptors, the polypeptide library is expressed as thioredoxin fusion 
proteins (see, for example, U.S. Patents 5,270,181 arid 5,292,646; and PCT publication 
W094/ 02502). The combinatorial peptide can be attached one the terminus of the 
thioredoxin protein, or, for short peptide libraries, inserted into the so-called active loop. 

In one embodiment, the peptide library is derived to express a combinatorial library of 
polypeptides which are not based on any known sequence, nor derived from cDNA. That is, 
the sequences of the library are largely random. In preferred embodiments, the combinatorial 
polypeptides are in the range of 3-100 amino acids in length, more preferably at least 5-50, 
and even more preferably at least 10, 13, 15, 20 or 25 amino acid residues in length. 
Preferably, the polypeptides of the library are of uniform length. It will be understood that 
the length of the combinatorial peptide does not reflect any extraneous sequences which may 
be present in order to facilitate expression, e.g., such as signal sequences or invariant portions 
of a fusion protein. 

In another embodiment, the peptide library is derived to express a combinatorial 
library of polypeptides which are based at least in part on a known polypeptide sequence or a 
portion thereof (not a cDNA library). That is, the sequences of the library is semi-random, 
being derived by combinatorial mutagenesis of a known sequence. See, for example, Ladner 
et al. PCT publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks 
et al. (1992) J. Biol. Chem. 267:16007-16010; Griffths et al. (1993) EMBO J 12:725-734; 
Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461. 
Accordingly, polypeptide(s) which are known ligands for a target receptor can be 
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mutagenized by standard techniques to derive a variegated library of polypeptide sequences 
which can further be screened for agonists and/or antagonists. For example, the surrogate 
ligand identified for FPRL-1, e.g., the Ser-Leu-Leu-Trp-Leu-Thr-Cys-Arg-Pro-Trp-Glu- Ala- 
Met peptide, can be mutagenized to generate a library of peptides with some relationship to 
the original tridecapeptide. This library can be expressed in a reagent cell of the present 
invention, and other receptor activators can be isolated from the library. This may permit the 
identification of even more potent FPRL-1 surrogate ligands. 

Alternatively, the library can be expressed under conditions wherein the cells are in 
contact with the original tridecapeptide, e.g., the FPRL-1 receptor is being induced by that 
surrogate ligand. Peptides from an expressed library can be isolated based on their ability to 
potentiate the induction, or to inhibit the induction, caused by the surrogate ligand. The latter 
of course will identify potential antagonists of chemoattractant receptors. In still other 
embodiments, the surrogate ligand can be used to screen exogenous compound libraries 
(peptide and non-peptide) which, by modulating the activity of the identified surrogate, will 
presumably also similarly effect the native ligand's effect on the target receptor. In such 
embodiments, the surrogate ligand can be applied to the cells, though is preferably produced 
by the reagent cell, thereby providing an autocrine cell. 

In still another embodiment, the combinatorial polypeptides are produced from a 
cDNA library. 

In a preferred embodiment of the present invention, the yeast cells collectively 
produce a "peptide library", preferably including at least 10 3 to 10 7 different peptides, so that 
diverse peptides may be simultaneously assayed for the ability to interact with the exogenous 
receptor. In an especially preferred embodiment, at least some peptides of the peptide library 
are secreted into the periplasm, where they may interact with the "extracellular" binding 
site(s) of an exogenous receptor. They thus mimic more closely the clinical interaction of 
drugs with cellular receptors. This embodiment optionally may be further improved (in 
assays not requiring pheromone secretion) by preventing pheromone secretion, and thereby 
avoiding competition between the peptide and the pheromone for signal peptidase and other 
components of the secretion system. 

In the present invention, the peptides of the library are encoded by a mixture of DNA 
molecules of different sequence. Each peptide-encoding DNA molecule is ligated with a 
vector DNA molecule and the resulting recombinant DNA molecule is introduced into a host 
cell. Since it is a matter of chance which peptide encoding DNA molecule is introduced into 
a particular cell, it is not predictable which peptide that cell will produce. However, based on 
a knowledge of the manner in which the mixture was prepared, one may make certain 
statistical predictions about the mixture of peptides in the peptide library. 

It is convenient to speak of the peptides of the library as being composed of constant 


and variable residues. If the nth residue is the same for all peptides of the library, it is said to 
be constant. If the nth residue varies, depending on the peptide in question, the residue is a 
variable one. The peptides of the library will have at least one, and usually more than one, 
variable residue. A variable residue may vary among any of two to all twenty of the 
genetically encoded amino acids; the variable residues of the peptide may vary in the same or 
different manner. Moreover, the frequency of occurrence of the allowed amino acids at a 
particular residue position may be the same or different. The peptide may also have one or 
more constant residues. 

There are two principal ways in which to prepare the required DNA mixture. In one 
method, the DNAs are synthesized a base at a time. When variation is desired, at a base 
position dictated by the Genetic Code, a suitable mixture of nucleotides is reacted with the 
nascent DNA, rather than the pure nucleotide reagent of conventional polynucleotide 
synthesis. 

The second method provides more exact control over the amino acid variation. First, 
trinucleotide reagents are prepared, each trinucleotide being a codon of one (and only one) of 
the amino acids to be featured in the peptide library. When a particular variable residue is to 
be synthesized, a mixture is made of the appropriate trinucleotides and reacted with the 
nascent DNA. Once the necessary "degenerate" DNA is complete, it must be joined with the 
DNA sequences necessary to assure the expression of the peptide, as discussed in more detail 
below, and the complete DNA construct must be introduced into the yeast cell. 

XI. Screening and Selection: Assays of Second Messenger Generation 

When screening for bioactivity of peptides, intracellular second messenger generation 
can be measured directly. A variety of intracellular effectors have been identified as being G- 
protein-regulated, including adenylyl cyclase, cyclic GMP, phosphodiesterases, 
phosphoinositidase C, and phpspholipase A2- In addition, G proteins interact with a range of 
ion channels and are able to inhibit certain voltage-sensitive Ca" 1-1- transients, as well as 
stimulating cardiac K + channels. 

In one embodiment, the GTPase enzymatic activity by G proteins can be measured in 
plasma membrane preparations by detenriining the breakdown of y^2p QTP using techniques 
that are known in the art (For example, see Signal Transduction: A Practical Approach. G. 
Milligan, Ed. Oxford University Press, Oxford England). When receptors that modulate 
cAMP are tested, it will be possible to use standard techniques for cAMP detection, such as 
competitive assays which quantitate [ 3 H]cAMP in the presence of unlabelled cAMP. 

Certain receptors stimulate the activity of phospholipase C which stimulates the 
breakdpwn of phosphatidylinositol 4,5, bisphosphate to 1,4,5-IP3 (which mobilizes 
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intracellular Ca++) and diacylglycerol (DAG) (which activates protein kinase C). Inositol 
lipids can be extracted and analyzed using standard lipid extraction techniques. DAG can 
also be measured using thin-layer chromatography. Water soluble derivatives of all three 
inositol lipids (IP1, IP2, IP3) can also be quantitated using radiolabelling techniques or 
HPLC. 

The mobilization of intracellular calcium or the influx of calcium from outside the 
cell can be measured using standard techniques. The choice of the appropriate calcium 
indicator, fluorescent, bioluminescent, metallochromic, or Ca-M-sensitive microelectrodes 
depends on the cell type and the magnitude and time constant of the event under study (Borle 
(1990) Environ Health Perspect 84:45-56). As an exemplary method of Ca++ detection, 
cells could be loaded with the Ca++sensitive fluorescent dye fura-2 or indo-1, using standard 
methods, and any change in Ca+4- measured using a fluorometer. 

The other product of PIP2 breakdown, DAG can also be produced from phosphatidyl 
choline. The breakdown of this phospholipid in response to receptor-mediated signaling can 
also be measured using a variety of radiolabelling techniques. 

The activation of phospholipase A2 can easily be quantitated using known 
techniques, including, for example, the generation of arachadonate in the cell. 

In the case of certain receptors, it may be desirable to screen for changes in cellular 
phosphorylation. Such assay formats may be useful when the receptor of interest is a 
receptor tyrosine kinase. For example, yeast transformed with the FGF receptor and a ligand 
which binds the FGF receptor could be screened using colony immunoblotting (Lyons and 
Nelson (1984) Proc. Natl. Acad Sci. USA 81:7426-7430) using anti-phosphotyrosine. In 
addition, tests for phosphorylation could be useful when a receptor which may not itself be a 
tyrosine kinase, activates protein kinases that function downstream in the signal transduction 
pathway. Likwise, it is noted that protein phosphorylation also plays a critical role in 
cascades that serve to amplify signals generated at the receptor. Multi-kinase cascades allow 
not only signal amplification but also signal divergence to multiple effectors that are often 
cell-type specific, allowing a growth factor to stimulate mitosis of one cell and differentiation 
of another. 

One such cascade is the MAP kinase pathway that appears to mediate both mitogenic, 
differentiation and stress responses in different cell types. Stimulation of growth factor 
receptors results in Ras activation followed by the sequential activation of c-Raf, MEK, and 
p44 and p42 MAP kinases (ERK1 and ERK2). Activated MAP kinase then phosphorylates 
many key regulatory proteins, including p90RSK and Elk-1 that are phosphorylated when 
MAP kinase translocates to the nucleus. Homologous pathways exist in mammalian and 
yeast cells. For instance, an essential part of the S. cerevisiae pheromone signaling pathway 
is comprised of a protein kinase cascade composed of the products of the STE1 1, STE7, and 


FUS3/KSS1 senes (the latter pair are distinct and functionally redundant). Accordingly, 
phosphorylation and/or activation of members of this kinase cascade can be detected and 
used to quantitate receptor engagement. Phosphotyrosine specific antibodies are available to 
measure increases in tyrosine phosphorylation and phospho-specific antibodies are 
commercially available (New England Biolabs, Beverly, MA). 

Modified methods for detecting receptor-mediated signal transduction exist and one 
of skill in the art will recognize suitable methods that may be used to substitute for the 
example methods listed. 

X1L Screening and Selection Using Reporter Gene Constructs 

In addition to measuring second messenger production, reporter gene constructs can 
be used. Reporter gene constructs are prepared by operatively linking a reporter gene with at 
least one transcriptional regulatory element. If only one transcriptional regulatory element is 
included it must be a regulatable promoter, At least one the selected transcriptional regulatory 
elements must be indirectly or directly regulated by the activity of the selected cell-surface 
receptor whereby activity of the receptor can be monitored via transcription of the reporter 
genes. 

The construct may contain additional transcriptional regulatory elements, such as a 
FIRE sequence, or other sequence, that is not necessarily regulated by the cell surface protein, 
but is selected for its ability to reduce background level transcription or to amplify the 
transduced signal and to thereby increase the sensitivity and reliability of the assay. 

Many reporter genes and transcriptional regulatory elements are known to those of 
skill in the art and others may be identified or synthesized by methods known to those of skill 
in the art. Reporter genes 

A reporter gene includes any gene that expresses a detectable gene product, which 
may be RNA or protein. Preferred reporter genes are those that are readily detectable. The 
reporter gene may also be included in the construct in the form of a fusion gene with a gene 
that includes desired transcriptional regulatory sequences or exhibits other desirable 
properties. 

Examples of reporter genes include, but are not limited to CAT (chloramphenicol 
acetyl transferase) (Alton and Vapnek (1979), Nature 282: 864-869) luciferase, and other 
enzyme detection systems, such as beta-galactosidase; firefly luciferase (deWet et al. (1987), 
Mol. Cell. Biol. 7:725-737); bacterial luciferase (Engebrecht and Silverman (1984), PNAS 1: 
4154-4158; Baldwin et al. (1984), Biochemistry 23: 3663-3667); alkaline phosphatase (Toh 
et al. (1989) Eur. J. Biochem. 182: 231-238, Hall et al. (1983) J. Mol. Appl. Gen. 2: 101), 
human placental secreted alkaline phosphatase (Cullen and Malim (1992) Methods in 


Enzymol. 216:362-368). 

Transcriptional control elements include, but are not limited to, promoters, enhancers, 
and repressor and activator binding sites. Suitable transcriptional regulatory elements may be 
derived from the transcriptional regulatory regions of genes whose expression is rapidly 
induced, generally within minutes, of contact between the cell surface protein and the effector 
protein that modulates the activity of the cell surface protein. Examples of such genes 
include, but are not limited to, the immediate early genes (see, Sheng et al. (1990) Neuron 4: 
477-485), such as c-fos, Immediate early genes are genes that are rapidly induced upon 
binding of a ligand to a cell surface protein. The transcriptional control elements that are 
preferred for use in the gene constructs include transcriptional control elements from 
immediate early genes, elements derived from other genes that exhibit some or all of the 
characteristics of the immediate early genes, or synthetic elements that are constructed such 
that genes in operative linkage therewith exhibit such characteristics. The characteristics of 
preferred genes from which the transcriptional control elements are derived include, but are 
not limited to, low or undetectable expression in quiescent cells, rapid induction at the 
transcriptional level within minutes of extracellular simulation, induction that is transient and 
independent of new protein synthesis, subsequent shut-off of transcription requires new 
protein synthesis, and mRNAs transcribed from these genes have a short half-life. It is not 
necessary for all of these properties to be present. 

In the most preferred constructs, the transcriptional regulatory elements are derived 
from the c-fos gene. 

The c-fos proto oncogene is the cellular homolog of the transforming gene of FBJ 
osteosarcoma virus. It encodes a nuclear protein that most likely involved in normal cellular 
growth and differentiation. Transcription of c-fos is transiently and rapidly activated by 
growth factors and by other inducers of other cell surface proteins, including hormones, 
differentiation-specific agents, stress, mitogens and other known inducers of cell surface 
proteins. Activation is protein synthesis independent. The c-fos regulatory elements include 
(see, Verma et al. (1987) Cell 51 : a TATA box that is required for transcription initiation; two 
upstream elements for basal transcription, and an enhancer, which includes an element with 
dyad symmetry and which is required for induction by TP A, serum, EGF, and PMA. 

The 20 bp transcriptional enhancer element located between - 317 and - 298 bp 
upstream from the c-fos mRNA cap site, which is essential for serum induction in serum 
starved NIH 3T3 cells. One of the two upstream elements is located at -63—57 and it 
resembles the consensus sequence for cAMP regulation. 

Other promoters and transcriptional control elements, in addition to those described 
above, include the vasoactive intestinal peptide (VIP) gene promoter (cAMP responsive; Fink 
et al. (1988), Proc. Natl. Acad. Sci. 85:6662-6666); the somatostatin gene promoter (cAMP 


responsive; Montminy et al. (1986), Proc. Natl. Acad. Sci. 8.3:6682-6686); the proenkephalin 
promoter (responsive to cAMP, nicotinic agonists, and phorbol esters; Comb et al. (1986), 
Nature 323:353-356); the phosphoenolpyruvate carboxy-kinase gene promoter (cAMP 
responsive; Short et al. (1986), J. Biol. Chem. 261:9721-9726); the NGFI-A gene promoter 
(responsive to NGF, cAMP, and serum; Changelian et al. (1989). Proc. Natl. Acad. Sci. 
86:377-381); and others that may be known to or prepared by those of skill in the art. 

In certain assays it may be desirable to use changes in growth in the screening 
procedure. For example, one of the consequences of activation of the pheromone signal 
pathway in wild-type yeast is growth arrest. If one is testing for an antagonist of a G 
protein-coupled receptor, this normal response of growth arrest can be used to select cells in 
which the pheromone response pathway is inhibited. That is, cells exposed to both a known 
agonist and a peptide of unknown activity will be growth arrested if the peptide is neutral or 
an agonist, but will grow normally if the peptide is an antagonist. Thus, the growth arrest 
response can be used to advantage to discover peptides that function as antagonists. 

However, when searching for peptides which can function as agonists of G 
protein-coupled receptors, or other pheromone system proteins, the growth arrest consequent 
to activation of the pheromone response pathway is an undesirable effect since cells that bind 
peptide agonists stop growing while surrounding cells that fail to bind peptides will continue 
to grow. The cells of interest, then, will be overgrown or their detection obscured by the 
background cells, confounding identification of the cells of interest. To overcome this 
problem the present invention teaches engineering the cell such that: 1) growth arrest does 
not occur as a result of exogenous signal pathway activation (e.g., by inactivating the FAR1 
gene); and/or 2) a selective growth advantage is conferred by activating the pathway (e.g., by 
transforming an auxotrophic mutant with a HIS3 gene under the control of a pheromone- 
responsive promoter, and applying selective conditions). 

It is, of course, desirable that the exogenous receptor be exposed on a continuing basis 
to the peptides. Unfortunately, this is likely to result in desensitization of the pheromone 
pathway to the stimulus. For example, the mating signal transduction pahtway is known to 
become desensitized by several mechanisms including pheromone degradation and 
modification of the function of the receptor, G proteins,s and/or downstream elements of the 
pheromone signal transduction by the products of the SST2, STE50, AFR1 (Konopka, J.B. 
(1993) Mol Cell Biol 13:6876-6888) and SGV1, MSG5, and SIG1 genes. Selected 
mutations in these genes can lead to hypersensitivity to pheromone and an inability to adapt 
to the presence of pheromone. For example, introduction of mutations that interfere with 
function into strains expressing heterologous G protein-coupled receptors constitutes a 
significant improvement on wild type strains and enables the development of extremely 
sensitive bioassays for compounds that interact with the receptors. Other mutations e.g. 


STE50, sgvl,barl, ste2,ste3,pikl,msg5, sigl, and aftl, have the similar effect of increasing 
the sensitivity of the bioassay. Thus desensitization may be avoided by mutating (which may 
include deleting) the SST2 gene so that it no longer produces a functional protein, or by 
mutating one of the other genes listed above. 

If the endogenous homolog of the receptor is produced by the yeast cell, the assay will 
not be able to distinguish between peptides which interact with the endogenous receptor and 
those which interact with the exogenous receptor. It is therefore desirable that the endogenous 
gene be deleted or otherwise rendered nonfunctional. 

In the case of receptors which modulate cyclic AMP, a transcriptional based readout 
can be constructed using the cyclic AMP response element binding protein, CREB, which is 
a transcription factor whose activity is regulated by phosphorylation at a particular serine 
(SI 33). When this serine residue is phosphorylated, CREB binds to a recognition sequence 
known as a CRE (cAMP Responsive Element) found to the 5* of promotors known to be 
responsive to elevated cAMP levels. Upon binding of phosphorylated CREB to a CRE, 
transcription from this promoter is increased. 

Phosphorylation of CREB is seen in response to both increased cAMP levels and 
increased intracellular Ca levels. Increased cAMP levels result in activation of PKA, which 
in turn phosphorylates CREB and leads to binding to CRE and transcriptional activation. 
Increased intracellular calcium levels results in activation of calcium/calmodulin responsive 
kinase IV (CaM kinase IV). Phosphorylation of CREB by CaM kinase IV is effectively the 
same as phosphorylation of CREB by PKA, and results in transcriptional activation of CRE 
containing promotors. 

Therefore, a transcriptional-based readout can be constructed in cells containing a 
reporter gene whose expression is driven by a basal promoter containing one or more CRE. 
Changes in the intracellular concentration of Ca++ (a result of alterations in the activity of the 
receptor upon engagement with a ligand) will result in changes in the level of expression of 
the reporter gene if: a) CREB is also co-expressed in the cell, and b) either the endogenous 
yeast CaM kinase will phosphorylate CREB in response to increases in calcium or if an 
exogenously expressed CaM kinase IV is present in the same cell. In other words, 
stimulation of PLC activity will result in phosphorylation of CREB and increased 
transcription from the CRE-construct, while inhibition of PLC activity will result in 
decreased transcription from the CRE-responsive construct. 

As described in Bonni et al. (1993) Science 262:1575-1579, the observation that 
CNTF treatment of SK-N-MC cells leads to the enhanced interaction of STAT/p91 and 
STAT related proteins with specific DNA sequences suggested that these proteins might be 
key regulators of changes in gene expression that are triggered by CNTF. Consistent with this 
possibility is the finding that DNA sequence elements similar to the consensus DNA 
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sequence required for STAT/p91 binding are present upstream of a number of genes 
previously found to be induced by CNTF (e.g., Human c-fos, Mouse c-fos, Mouse tisl 1, Rat 
junB, Rat SOD-1, and CNTF). Those authors demonstrated the ability of STAT/p91 binding 
sites to confer CNTF responsiveness to a non-responsive reporter gene. Accordingly, a 
reporter construct for use in the present invention for detecting signal transduction through 
STAT proteins, such as from cytokine receptors, can be generated by using -71 to +109 of the 
mouse c-fos gene fused to the bacterial chloramphenicol acetyltransferase gene (-71fosCAT) 
or other detectable marker gene. Induction by a cytokine receptor induces the tyrosine 
phosphorylation of STAT and STAT-related proteins, with subsequent translocation and 
binding of these proteins to the STAT-RE. This then leads to activation of transcription of 
genes containing this DNA element within their promoters. 

In preferred embodiments, the reporter gene is a gene whose expression causes a 
phenotypic change which is screenable or selectable. If the change is selectable, the 
phenotypic change creates a difference in the growth or survival rate between cells which 
express the reporter gene and those which do not. If the change is screenable, the phenotype 
change creates a difference in some detectable characteristic of the cells, by which the cells 
which express the marker may be distinguished from those which do not. Selection is 
preferable to screening in that it can provide a means for amplifying from the cell culture 
those cells which express a test polypeptide which is a receptor effector. 

The marker gene is coupled to the receptor signaling pathway so that expression of 
the marker gene is dependent on activation of the receptor. This coupling may be achieved by 
operably linking the marker gene to a receptor-responsive promoter. The term 
"receptor-responsive promoter" indicates a promoter which is regulated by some product of 
the target receptor's signal transduction pathway. 

Alternatively, the promoter may be one which is repressed by the receptor pathway, 
thereby preventing expression of a product which is deleterious to the cell. With a receptor 
repressed promoter, one screens for agonists by linking the promoter to a deleterious gene, 
and for antagonists, by linking it to a beneficial gene. Repression may be achieved by 
operably linking a receptor- induced promoter to a gene encoding mRNA which is antisense 
to at least a portion of the mRNA encoded by the marker gene (whether in the coding or 
flanking regions), so as to inhibit translation of that mRNA. Repression may also be obtained 
by linking a receptor-induced promoter to a gene encoding a DNA binding repressor protein, 
and incorporating a suitable operator site into the promoter or other suitable region of the 
marker ^gene. 

In the case of yeast, suitable positively selectable (beneficial) genes include the 
following: URA3, LYS2, HIS3, LEU2, TRP1; ADEJ, 2,3,4,5, 7,8; ARGl, 3, 4, 5, 6, 8; HIS1, 4, 
5; ILV1, 2, 5; THR1, 4; TRP2, 3, 4, 5; LEVI, 4; MET2,3,4,8,9,14,16,19; URA1, 2,4,5, 10; 


H0M3,6; ASP3; CHOI; ARO 2, 7; CYS3; OLE1; IN01,2 t 4; PR01,3 Countless other genes are 
potential selective markers. The above are involved in well-characterized biosynthetic 
pathways. The imidazoleglycerol phosphate dehydratase (IGP dehydratase) gene (HIS3 ) is 
preferred because it is both quite sensitive and can be selected over a broad range of 
expression levels. In the simplest case, the cell is auxotrophic for histidine (requires histidine 
for growth) in the absence of activation. Activation leads to synthesis of the enzyme and the 
cell becomes prototrophic for histidine (does not require histidine). Thus the selection is for 
growth in the absence of histidine. Since only a few molecules per cell of IGP dehydratase 
are required for histidine prototrophy, the assay is very sensitive. 

In a more complex version of the assay, cells can be selected for resistance to 
aminotriazole (AT), a drug that inhibits the activity of IGP dehydratase. Cells with low, fixed 
level of expression of HIS3 are sensitive to the drug, while cells with higher levels are 
resistant. The amount of AT can be selected to inhibit cells with a basal level of HI S3 
expression (whatever that level is) but allow growth of cells with an induced level of 
expression. In this case selection is for growth in the absence of histidine and in the presence 
of a suitable level of AT. 

In appropriate assays, so-called counterselectable or negatively selectable genes may 
be used. Suitable genes include: URA3 (orotidine-S'-phosphate decarboxylase; inhibits 
growth on 5-fluoroorotic acid), LYS2 (2-aminoadipate reductase; inhibits growth on 
cc-aminoadipate as sole nitrogen source), CYH2 (encodes ribosomal protein L29; 
cycloheximide-sensitive allele is dominant to resistant allele), CAN1 (encodes arginine 
permease; null allele confers resistance to the arginine analog canavanin), and other 
recessive drug-resistant markers. 

In one example, the marker gene effects yeast cell growth. The natural response to 
signal transduction via the yeast pheromone system response pathway is for cells to undergo 
growth arrest. This is the preferred way to select for antagonists to a ligand/receptor pair that 
induces the pathway. An autocrine peptide antagonist would inhibit the activation of the 
pathway; hence, the cell would be able to grow. Thus, the FAR1 gene may be considered an 
endogenous counterselectable marker. The FAR1 gene is preferably inactivated when 
screening for agonist activity. 

The marker gene may also be a screenable gene. The screened characteristic may be a 
change in cell morphology, metabolism or other screenable features. Suitable markers 
include beta-galactosidase (Xgal, C12FDG, Salmon-gal, Magenta-Gal (latter two from 
Biosyntfr Ag)), alkaline phosphatase, horseradish peroxidase, exo-glucanase (product of yeast 
exbl gene; nonessential, secreted); luciferase; bacterial green fluorescent protein; (human 
placental) secreted alkaline phosphatase (SEAP); and chloramphenicol transferase (CAT). 
Some of the above can be engineered so that they are secreted (although not P-galactosidase). 
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A preferred screenable marker gene is beta-galactosidase; yeast cells expressing the enzyme 
convert the colorless substrate Xgal into a blue pigment. Again, the promoter may be 
receptor-induced or receptor-inhibited. 

XIII Genetic Markers in Yeast Strains 

Yeast strains that are auxotrophic for histidine (HIS3) are known, see Struhl and Hill, 
(1987) Mol Cell Biol, 7:104; Fasullo and Davis, Mol Cell Biol, (1988) 8:4370. The HIS3 
(imidazoleglycerol phosphate dehydratase) gene has been used as a selective marker in yeast. 
See Sikorski and Heiter, (1989) Genetics, 122:19; Struhl, et al., P.N.A.S. (1979) 76:1035; 
and, for FUSl-HIS3 fusions, see Stevenson, et aL, (1992) Genes Dev., 6:1293. 

IXX. Novel FPRL-1 ligand 

Yet another aspect of the invention pertains to a novel ligand for the orphan receptor, 
FPRL-1. As described in Example 8, a tridecapeptide having the sequence Ser-Leu-Leu-Trp- 
Leu-Thr-Cys-Arg-Pro-Trp-Glu-Ala-Met was identified from a polypeptide library on the 
basis of its ability to act as a surrogate ligand for FPRL-1 . 

Chemoattractants are important mediators of inflammation, they function to recruit 
phagocytic cells at the site of injury or infection. They also mediate granule secretion, 
superoxide generation and upregulation of cell surface adhestion molecules, for example 
MAC-1. Exemplary chemoattractants include the complement component C5a, interleukin 8, 
leukotriene B4 and platelet activiating factor. Many of these substances participate in 
pathophysiological conditions such as anaphylaxis and septic shock. The identification of 
ligands for the orphan FPRL1 receptor provides new opportunities for discovery of receptor 
agonists, that could potentially serve to enhance lymphocyte recruitment in 
immunocompromised patients, and for the discovery of receptor antagonists (described 
supra) that could prevent undesirable consequences of immune activation such as 
anaphylactic or septic shock. 

The term "peptide" is used herein to refer to a chain of two or more amino acids or 
amino acid analogs (including non-naturally occurring amino acids), with adjacent amino 
acids joined by peptide (-NHCO-) bonds. Thus, the peptides of the present invention include 
oligopeptides, polypeptides, and proteins. Preferably, the peptides of the present invention 
include all or a portion of the S-L-L-W-L-T-C-R-P-W-E-A-M peptide, or a homolog thereof. 
The peptide (or peptidomimetic) is preferably at least 3 amino acid residues in length, though 
peptides of 4, 5, 7, 10, 13 or more residues in length are contemplated. For example, the 
sequence derived from the FPRL-1 surrogate ligand can be provided as part of a fusion 
protein. The minimum peptide length is chiefly dictated by the need to obtain sufficient 



potency as an activator or inhibitor. Given the size of the peptide isolated in subject assay, 
smaller fragments of the tridecapeptide which retain receptor binding activity will be easily 
identified, e.g., by chemical synthesis of different fragments. The maximum peptide length 
will only be a function of synthetic convenience once an active peptide is identified. 

The invention also provides for the generation of mimetics, e.g. peptide or non- 
peptide agents. Moreover, the present invention also contemplates variants of the subject 
polypeptide which may themselves be either agonistic or antagonistic of the S-L-L-W-L-T- 
C-R-P- W-E-A-M peptide. Thus, using such mutagenic techniques as known in the art, the 
determinants of S-L-L- W-L-T-C-R-P- W-E-A-M polypeptide which participate in FPRL-1 
interactions can be ellucidated. To illustrate, the critical residues of a subject polypeptide 
which are involved in molecular recognition of an FPRL-1 receptor can be determined and 
used to generate variant polypeptides which competitively inhibit binding of the authentic S- 
L-L-W-L-T-C-R-P- W-E-A-M peptide with that receptor. By employing, for example, 
scanning mutagenesis to map the amino acid residues of the polypeptide involved in binding 
the FPRL-1 receptor, peptide and peptidomimetic compounds can be generated which mimic 
those residues in binding to the receptor and which consequently can inhibit binding of an 
authentic ligand for the FPRL-1 receptor and interfere with the function of that receptor. 

Moreover, as is apparent from the present and parent disclosures, mimetopes of the 
subject S-L-L-W-L-T-C-R-P- W-E-A-M peptide can be provided as non-hydrolyzable peptide 
analogs. For illustrative purposes, peptide analogs of the present invention can be generated 
using, for example, benzodiazepines (e.g., see Freidinger et al. in Peptides: Chemistry and 
Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted 
gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988, pi 23), C-7 mimics (Huffman et al. in 
Peptides: Chemistry and Biologyy, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988, p. 105), keto-methylene pseudopeptides (Ewenson et al. (1986) J Med 
Chem 29:295; and Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th 
American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), P-turn dipeptide 
cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin 
Trans 1:1231), p-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res 
Communl26:419; and Dann et al. (1986) Biochem Biophys Res Commun 134:71), 
diaminoketones (Natarajan et al. (1984) Biochem Biophys Res Commun 124:141), and 
methyleneamino-modifed (Roark et al. in Peptides: Chemistry and Biology, G.R. Marshall 
ed„ ESCOM Publisher: Leiden, Netherlands, 1988, pl34). Also, see generally, Session III: 
Analytic" and synthetic methods, in in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988) 

In an exemplary embodiment, the peptidomimetic can be derived as a retro-inverso 
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analog of the peptide. To illustrate, the S-L-L-W-L-T-C-R-P-W-E-A-M peptide can be 
generated as the retro-inverso analog: 


Such retro-inverso analogs can be made according to the methods known in the art, 
such as that described by the Sisto et aL U.S. Patent 4,522,752. For example, the illustrated 
retro-inverso analog can be generated as follows. The geminal diamine corresponding to the 
serine analog is synthesized by treating a protected serine with ammonia under HOBT-DCC 
coupling conditions to yield the N-Boc amide, and then effecting a Hofmann-type 
rearrangement with I,I-bis-(trifluoroacetoxy)iodobenzene (TIB), as described in 
Radhakrishna et al. (1979) J. Org. Chem. 44:1746. The product amine salt is then coupled to 
a side-chain protected (e.g., as the benzyl ester) N-Fmoc D-Leu residue under standard 
conditions to yield the pseudodipeptide. The Fmoc (fluorenylmethoxycarbonyl) group is 
removed with piperidine in dimethylformamide, and the resulting amine is trimethylsilylated 
with bistrimethylsilylacetamide (BSA) before condensation with suitably alkylated, side- 
chain protected derivative of Meldrum's acid, as described in U.S. Patent 5,061,81 1 to Pinori 
et al., to yield the retro-inverso tripeptide analog S-L-L. The pseudotripeptide is then 
coupled with L-Trp under standard conditions to give the protected tetrapeptide analog. The 
protecting groups are removed to release the product, and the steps repeated to enlogate the 
tetrapeptide to the full length peptide. It will be understood that a mixed peptide, e.g. 
including some normal peptide linkages, can be generated. As a general guide, sites which 
are most susceptible to proteolysis are typically altered, with less susceptible amide linkages 
being optional for mimetic switching The final product, or intermediates thereof, can be 
purified by HPLC. 

In another illustrative embodiment, the peptidomimetic can be derived as a retro- 
enatio analog of the peptide, such as the exemplary retro-enatio peptide analog derived for 
the illustrative S-L-L-W-L-T-C-R-P-W-E-A-M peptide: 




NH 3 -(d) Met-(d) Aia-(d) Glu-(d)Trp.... (d) Trp- (d) Leu-(d)-Leu-(d) Ser 

Retro-enantio analogs such as this can be synthesized using commercially available 
D-amino acids and standard, solid- or solution-phase peptide-synthesis techniques. For 
?=i example, in a preferred solid-phase synthesis method, a suitably amino-protected (t- 
sB butyloxycarbonyl, Boc) D-Serine residue (or analog thereof) is covalently bound to a solid 
J support such as chloromethyl resin. The resin is washed with dichloromethane (DCM), and 
SJ the BOC protecting group removed by treatment with TFA in DCM. The resin is washed and 
/z neutralized, and the next Boc-protected D-amino acid (D-Leu) is introduced by coupling with 
diisopropylcarbodiimide. The resin is again washed, and the cycle repeated for each of the 
5 remaining amino acids in turn (D-Leu, D-Trp etc). When synthesis of the protected retro- 
j=s ! enantio peptide is complete, the protecting groups are removed and the peptide cleaved from 
fU the solid support by treatment with hydrofluoric acid/anisole/dimethyl sulfide/thioanisole. 
The final product is purified by HPLC to yield the pure retro-enantio analog. 

Q In still another illustrative embodiment, trans-olefin derivatives can be made for the 

subject polypeptide. For example, an exemplary olefin analog is derived for the illustrative 
S-L-L-W-L-T-C-R-P-W-E-A-M peptide: 



The trans olefin analog of the subject peptide can be synthesized according to the method of 


Y.K. Shue et al. (1987) Tetrahedron Letters 28:3225. 

Still another class of peptidomimetic derivatives include the phosphonate derivatives, 
such as the partially phosphonate derivatived S-L-L-W-L-T-C-R-P-W-E-A-M peptide: 


The synthesis of such phosphonate derivatives can be adapted from known synthesis 
schemes. See, for example, Loots et al. in Peptides: Chemistry and Biology, (Escom Science 
Publishers, Leiden, 1988, p. 118); Petrillo et al. in Peptides: Structure and Function 
(Proceedings of the 9th American Peptide Symposium, Pierce Chemical Co. Rockland, IL, 
1985). 


The invention now being generally described will be more readily understood by 
reference to the following examples, which are included merely for purposes of illustration of 
certain aspects and embodiments of the present invention and are not intended to limit' the 
invention. 

Example 1: Development of Autocrine Yeast Strains 

In this example, we describe a pilot experiment in which haploid cells were 
engineered to be responsive to their own pheromones. (Note that in the examples, functional 
genes are capitalized and inactivated genes are in lower case.) For this purpose we 
constniQted recombinant DNA molecules designed to: 

i. place the coding region of STE2 under the transcriptional control of elements 
which normally direct the transcription of STE3, This is done in a plasmid that allows the 
replacement of genomic STE3 of S. cerevisiae with sequences wherein the coding sequence 
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of STE2 is driven by STE3 transcriptional control elements. 

ii. place the coding region of STE3 under the transcriptional control of elements 
which normally direct the transcription of STE2. This is done in a plasmid which will allow 
the replacement of genomic STE2 of S. cere visiae with sequences wherein the coding 
sequence of STE3 is driven by STE2 transcriptional control elements. 

The sequence of the STE2 gene is known see Burkholder A.C. and Hartwell L.H. 
(1985), Nuc. Acids Res. 13, 8463; Nakayama N., Miyajima A., Arai K. (1985) EMBO J. 4, 
2643. 

A 4.3 kb BamHI fragment that contains the entire STE2 gene was excised from 
plasmid YEp24-STE2 (obtained from J. Thorner, Univ. of California) and cloned into 
pALTER (Protocols and Applications Guide, 1991, Promega Corporation, Madison, WI). 
An Spel site was introduced 7 nucleotides (nts) upstream of the ATG of STE2 with the 
following mutagenic oligonucleotide, using the STE2 minus strand as template: 


£i A second Spel site was simultaneously introduced just downstream of the STE2 stop 

%! codon with the following mutagenic oligonucleotide: 

Cj 5 f -TG ATC A AAATTT ACTAQTTTG AAAAAGTAATTTCG 3' 

The BamHI fragment of the resulting plasmid (Cadus 1096) containing STE2 with 
M Spel sites immediately flanking the coding region, was then subcloned into the yeast 
p^J integrating vector YIpl9 to yield Cadus 1 143. 

j!r The STE3 sequence is also known (Nakayama N., Miyajima A., Arai K. (1985), 

g EMBO J. 4, 2643; (Hagen D.C., McCaffrey G., Sprague G.F. (1986), Proc. Natl Acad. Sci. 
83, 1418. STE3 was made available by Dr. J. Broach as a 3.1 kb fragment cloned into 
pBLUESCRIPT-KS II (Stratagene, 11011 North Torrey Pines Road, La Jolla, C A 92037). 
STE3 was subcloned as a KpnI-Xbal fragment into both M13mpl8 RF (to yield Cadus 1 105 
and pUC19 (to yield Cadus 1107). The two Spel sites in Cadus 1107 were removed by 
digestion with Spel, fill-in with DNA polymerase I Klenow fragment, and recircularization 
by blunt-end ligation. Single-stranded DNA containing the minus strand of STE3 was 
obtained using Cadus 1 105 and Spel sites were introduced 9 nts upstream of the start codon 
and 3 nts downstream of the stop codon of STE3 with the following mutagenic 
oligonucleotides, respectively: 


The mutagenesis was accomplished using the T7-GEN protocol of United States 
Biochemical (T7-GEN In Vitro Mutagenesis Kit, Descriptions and Protocols, 1991, United 


S'-GTTAAGAACCATATACTAGTATCAAAAATGTCTG 3' 



■GGCAAAATACTAQIAAAATTTTCATGTC 3' 


GGCCCTTAACACACIAGTGTCGCATTATATTTAC 3' 


States Biochemical, P.O. Box 22400, Cleveland, Ohio 44122). The replicative form of the 
resulting Cadus 1141 was digested with Aflll and Kpnl, and the approximately 2 kb fragment 
containing the entire coding region of STE3 flanked by the two newly introduced Spe I sites 
was isolated and ligated with the approximately 3.7 kb vector fragment of Aflll- and Kpnl- 
digested Cadus 1107, to yield Cadus 1138. Cadus 1138 was then digested with Xbal and 
Kpnl, and the STE3-containing 2.8 kb fragment was ligated into the Xbal- and KpnI-digested 
yeast integrating plasmid pRS406 (Sikorski, R.S. and Hieter, P. (1989) , Genetics 122:19-27) 
to yield Cadus 1 145. 

The Spel fragment of Cadus 1143 was replaced with the Spel fragment of Cadus 
1 145 to yield Cadus 1147, in which the coding sequences of STE3 are under the control of 
STE2 expression elements. Similarly, the Spel fragment of Cadus 1 145 was replaced with 
the Spel fragment of Cadus 1143 to yield Cadus 1148, in which the coding sequences of 
STE2 are under the control of STE3 expression elements. Using the method of pop-in/pop- 
out replacement (Rothstein, R. (1991) Methods in Enzymology, 194:281 301), Cadus 1147 
was used to replace genomic STE2 with the ste2-STE3 hybrid in a MATa cell and Cadus 
1 148 was used to replace genomic STE3 with the ste3-STE2 hybrid in a MATa cell. Cadus 
1 147 and 1 148 contain the selectable marker URA3. 

Haploid yeast of mating type a which had been engineered to express HIS3 under the 
control of the pheromone-inducible FUS1 promoter were transformed with CADUS 1147, 
and transformants expressing URA3 were selected. These transformants, which express both 
Ste2p and Ste3p, were plated on 5-fluoroorotic acid to allow the selection of clones which 
had lost the endogenous STE2, leaving in its place the heterologous, integrated STE3. Such 
cells exhibited the ability to grow on media deficient in histidine, indicating autocrine 
stimulation of the pheromone response pathway. 

Similarly, haploids of mating type a that can express HIS2_ under the control of the 
pheromone-inducible FUS1 promoter were transformed with CADUS 1 148 and selected for 
replacement of their endogenous STE3 with the integrated STE2. Such cells showed, by 
their ability to grow on histidine-deficient media, autocrine stimulation of the pheromone 
response pathway. 

Example 2: Strain Development 

In this example, yeast strains are constructed which will facilitate selection of clones 
which exhibit autocrine activation of the pheromone response pathway. To construct 
appropriate yeast strains, we will use: the YIp-STE3 and pRS-STE2 knockout plasmids 
described above, plasmids available for the knockout of FAR 1, SST2, and HIS3, and mutant 
strains that are commonly available in the research community. The following haploid 
strains will be constructed, using one-step or two-step knockout protocols described in Meth. 


Enzymol 194:281-301, 1991: 


1. 

2. 
3. 
4. 

5. 
6. 


MATa 
MA To. 
MATa 
MATa 
MATa 
MATa 


ste3::STE2::ste3 farl sst2 FUS1::HIS3 

ste2::STE3::ste2 farl sst2 FUS1::HIS3 

ste3::STE2::ste3 farl sst2 mfal mfa2 

ste2::STE3::ste2 farl sst2 mfal mfa2 


FUS1::HIS3 
FUS1::HIS3 


adel leu2 


barl farl-1 fusl~HIS3 stel4::TRPl ura3 trpl leu2 his3 
mfal mfa2 farl-1 his3::fusl-HIS3 ste2-STE3 ura3 metl 


Strains 1 and 2 will be tested for their ability to grow on histidine-deficient media as a 
result of autocrine stimulation of their pheromone response pathways by the pheromones 
which they secrete. If these tests prove successful, strain 1 will be modified to inactivate 
endogenous MFolI and MFa2. The resulting strain 3, MATa farl sst2 ste3::STE2::ste3 
FUS1::HIS3 mfal mfa2, should no longer display the selectable phenotype (i.e., the strain 
should be auxotrophic for histidine). Similarly, strain 2 will be modified to inactivate 
endogenous MFal and MFa2. The resulting strain 4, MATa farl sst2 ste2::STE3::ste2 
FUS1::HIS3 mfal mfa2, should be auxotrophic for histidine. The uses of strains 5 and 6 are 
outlined in Examples 3 and 4 below. 

Example 3: Peptide Library 

In this example, a synthetic oligonucleotide encoding a peptide is expressed so that 
the peptide is secreted or transported into the periplasm. 

i. The region of MFal which encodes mature a-factor has been replaced via 
single-stranded mutagenesis with restriction sites that can accept oligonucleotides with Aflll 
and Bglll ends. Insertion of oligonucleotides with Aflll and Bglll ends will yield plasmids 
which encode proteins containing the MFal signal and leader sequences upstream of the 
sequence encoded by the oligonucleotides. The MFal signal and leader sequences should 
direct the processing of these precursor proteins through the pathway normally used for the 
transport of mature a-factor. 

The MFal gene, obtained as a 1.8 kb EcoRI fragment from pDA6300 (J. Thorner, 
Univ. of California) was cloned into pALTER in preparation for oligonucleotide-directed 
mutagenesis to remove the coding region of mature a-factor while constructing sites for 
acceptance of oligonucleotides with Aflll and Bell ends. The mutagenesis was accomplished 
using the minus strand as template and the following mutagenic oligonucleotide: 


S'-CTAAAGAAGAAGGGGTATCTTTGCTTAAGCTCGAGATCTCGACTGATA- 



ACAACAGTGTAG-3* 

A Hindlll site was simultaneously introduced 7 nts upstream of the MFal start codon 
with the oligonucleotide: 

5' C ATAC AC AATATAAAGCTTTAAAAGAATG AG-3 ' 

The resulting plasmid, Cadus 1214, contains a Hindlll site 7 nts upstream of the MFa 
1 initiation codon, an Aflll site at the positions which encode the KEX2 processing site in the 
MFal leader peptide, and Xhol and Bglll sites in place of all sequences from the leader- 
encoding sequences up to and including the normal stop codon. The 1.5 kb Hindlll fragment 
of Cadus 1214 therefore provides a cloning site for oligonucleotides to be expressed in yeast 
and secreted through the pathway normally travelled by endogenous a-factor. 

A sequence comprising the ADCl promoter and 5' flanking sequence was obtained as 
a L5 kb BamHI-Hindlll fragment from pAAH5 (Ammerer, G. (1983) Academic Press, Inc., 
Meth. Enzymol. 101, 192-201 and ligated into the high copy yeast plasmid pRS426 
(Christianson, T.W et al. (1992) Gene 1 10:1 19-122) (see Figure 1). The unique Xhol site in 
the resulting plasmid was eliminated to yield Cadus 1 186. The 1.5 Kb Hindlll fragment of 
Cadus 1214 was inserted into Hindlll-digested Cadus 1186; expression of sequences cloned 
into this cassette initiates from the ADH1 promoter. The resulting plasmid, designated Cadus 
1215, can be prepared to accept oligonucleotides with Aflll and Bell ends by digestion with 
those restriction endonucleases. The oligonucleotides will be expressed in the context of MF 
al signal and leader peptides (Figure 2). 

Modified versions of Cadus 1215 were also constructed. To 30 improve the 
efficiency of ligation of oligonucleotides into the expression vector, Cadus 1215 was 
restricted with Kpnl and religated to yield Cadus 1337. This resulted in removal of one of 
two Hindlll sites. Cadus 1337 was linearized with Hindlll, filled-in, and recircularized to 
generate Cadus 1338. To further tailor the vector for library construction, the following 
double-stranded oligonucleotide was cloned into Aflll-and Bglll-digested Cadus 1338: 

5' TTAAGCGTGAGGCAGAAGCTTATCGATA oligo 062 

3' CGCACTCCGTCTTCGAATAGCTATCTAG oligo 063 

The Clal site is unique in the resulting vector, Cadus 1373. In Cadus 1373, the 
Hindlll site that exists at the junction between the MFa pro sequence and the mature peptide 
to be expressed by this vector was made unique. Therefore the Hindlll site and the 
downstream Bglll site can be used to insert oligo-nucleotides encoding peptides of interest. 
These modifications of Cadus 1215 provide an laternative to the use of the Aflll site in the 
cloning of oligonucleotides into the expressions vector. 


Cadus 1373 was altered further to permit elimination from restricted vector 
preparations of contaminating singly-cut plasmid. Such contamination could result in 
unacceptably high background transformation. To eliminate this possibility, approximately 
1.1 kb of dispensable ADH1 sequence at the 5' side of the promoter region was deleted. This 
was accomplished by restruction of Cadus 1373 with SphI and BamHI, fill-in, and ligation; 
this maneuver regenerates the BamHI site. The resulting vector, Cadus 1624, was then 
restricted with Hindlll and Clal and an approximately 1.4 kb Hindlll and Clal fragment 
encoding 25 lacZ was inserted to generate Cadus 1625. Use of Hindlll- and Bglll-restricted 
Cadus 1625 for acceptance of oligonucleotides results in a low background upon 
transformation of the ligation product into bacteria. 

Two single-stranded oligonucleotide sequences (see below) are synthesized, 
annealed, and repetitively filled in, denatured, and reannealed to form double-stranded 
oligonucleotides that, when digested with Aflll and Bell, can be ligated into the polylinker of 
the expression vector, Cadus 1215. The two single-stranded oligonucleotides have the 
following sequences: 

5*-G CTA CTT AAG CGT GAG GCA GAA GCT 3' and 

5'-C GGA TGA TCA (NNN) n AGC TTC TGC CTC ACG CTT AAG TAG C 3* 

where N is any chosen nucleotide and n is any chosen integer. Yeast transformed with the 
resulting plasmids will secrete — through the ct-factor secretory pathway ~ peptides whose 
amino acid sequence is determined by the particular choice of N and n). 

Alternatively, the following single stranded oligonucleotides are used: 
MFaNNK (76 mer) : 

5 f CT£GAIGCGAAGACAGCTNNKNNK^ 
NNK TGATCAGTCTGTGACGC 3' 

and MFaMbo (17 mer) : 

5' GCGTCACAGACTGATCA 3' 

When annealed the double stranded region is: 

TGA 7C AGTCTGTGACGC 

ACraqTCAGACACTGCG 

After fill-in using Taq DNA polymerase (Promega Corporation, Madison, 
Wisconsin), the double stranded product is restricted with Bbsl and Mbol and ligated to 
Hindlll- and Bglll-restricted Cadus 1373. 


ii. The region of MFal which encodes mature a-factor will be replaced via single 
stranded mutagenesis with restriction sites that can accept oligonucleotides with Xhol and 
Aflll ends. Insertion of oligonucleotides with Xhol and Aflll ends will yield plasmids which 
encode proteins containing the MFal leader sequences upstream of the sequence encoded by 
the oligonucleotides. The MFal leader sequences should direct the processing of these 
precursor proteins through the pathway normally used for the transport of mature a-factor. 

MFA1, obtained as a BamHI fragment from pKKl (provided by J. 30 Thorner and K. 
Kuchler), was ligated into the BamHI site of p ALTER (Promega). Using the minus strand of 
MFA1 as template, a Hindlll site was inserted by oligonucleo tide-directed mutagenesis just 5' 
to the MFA1 start codon using the following oligonucleotide: 

5' CCAAAATAAGTACAAAGCTTTCGAATAGAAATGCAACCATC 

A second oligonucleotide was used simultaneously to introduce a short polylinker for 
later cloning of synthetic oligonucleotides in place of MFA1 sequences. These MFA1 
sequences encode the C-terminal 5 amino acids of the 21 amino acid leader peptide through 
to the stop codon: 

S'GCCGCTCCAAAAGAAAAGACCTCGAGCTCGCTTAAGTTCTGCGTACAAAAACG- 
TTGTTC 3* 

The 1.6 kb Hindlll fragment of the resulting plasmid, Cadus 1172, contains 
sequences encoding the MFA1 start codon and the N-terminal 16 amino acids of the leader 
peptide, followed by a short polylinker containing Xhol, SacI, and Aflll sites for insertion of 
oligonucleotides. The 1.6 kb Hindlll fragment of Cadus 1172 was ligated into Hindlll- 
digested Cadus 1186 (see above) to place expression of sequences cloned into this cassette 
under the control of the ADH1 promoter. The SacI site in the polylinker was made unique by 
eliminating a second SacI site present in the vector. The resulting plasmid, designated Cadus 
1239, can be prepared to accept oligonucleotides with Xhol and Aflll ends by digestion with 
those restriction endonucleases for expression in the context of MFal leader peptides (Figure 
3). 

Two single-stranded oligonucleotide sequences (see below) are synthesized, annealed, and 
repetitively filled in, denatured, and reannealed to form double-stranded oligonucleotides 
that, when digested with Aflll and Bglll, can be cloned into the polylinker of the expression 
vector, Cadus 1239. The two single-stranded oligonucleotides used for the cloning have the 
following sequences: 

5* GG TAC TCG AGT GAA AAG AAG GAC AAC 3' 

5' CG TAC TTA AGC AAT AAC ACA (NNN) n GTT GTC CTT CTT TTC ACT CGA 
GTA CC 3\ 


where N is any chosen nucleotide and n is any chosen integer. 

Yeast transformed with the resulting plasmids will transport -- through the pathway normally 
used for the export of a-factor - farnesylated, carboxymethylated peptides whose amino acid 
sequence is determined by the particular choice of N and n (Figure 3). 

Example 4. Peptide Secretion/Transport. 

This example demonstrates the ability to engineer yeast such that they secrete or 
transport oligonucleotide-encoded peptides (in this case their pheromones) through the 
pathways normally used for the secretion or transport of endogenous pheromones. 

Autocrine MATa strain CY588: 

A MATa strain designed for the expression of peptides in the context of MFal (i.e., 
using the MFal expression vector, Cadus 1215) has been constructed. The genotype of this 
strain, which we designate CY588, is MATa bar J far 1-1 fusl-HIS3 stel4::TRPl ura3 
trpl leu2 his3. The bar! mutation eliminates the strain's ability to produce a protease that 
degrades a-factor and that may degrade some peptides encoded by the cloned 
oligonucleotides; ihefarl mutation abrogates the arrest of growth which normally follows 
stimulation of the pheromone response pathway; an integrated FUS1-HIS3 hybrid gene 
provides a selectable signal of activation of the pheromone response pathway; and, finally, 
the stel4 mutation lowers background of the FUS1-HIS3 readout. The enzymes responsible 
for processing of the MFal precursor in MATa cells are also expressed in MATa cells 
(Sprague and Thorner, in The Molecular and Cellular Biology of the Yeast Saccharomyces: 
Gene Expression, 1992, Cold Spring Harbor Press), therefore, CY588 cells should be able to 
secrete peptides encoded by oligonucleotides expressed from plasmid Cadus 1215. 

A high transforming version (tbtl-l) of CY588 was obtained by crossing CY1013 
(CY588 containing an episomal copy of the STE14 gene) (MATa barl;;hisGfarl-l fusl-HIS3 
stel4::TRPl ura3 trpl leu2 his3 [STE14 URA3 CEN4) to CY793 (MATol- tbtl-l ura3 leu2 
trpl his3 fusl-HIS2 canl stell4::TRPl [FUS1 LEU2 2u]) and selecting from the resultant 
spores a strain possessing the same salient genotype described for CY588 (see above), and in 
addition the tbl-1 allele, which confers the capacity for very high efficiency transformation 
by electroporation. The selected strain is CY1455 (MATabarl::hisGfarl-l Jusl-HIS3 
stel4::TRPl tbt-1 ura3 trpl leu2 his3). 

Secretion of peptides in the context of yeast ct-J actor: 

Experiments were performed to test: 1. the ability of Cadus 1215 to function as a 
vector for the expression of peptides encoded by synthetic oligonucleotides; 2. the suitability 
of the oligonucleotides, as designed, to direct the secretion of peptides through the a-factor 
secretory pathway; 3. the capacity of CY588 to secrete those peptides; and 4. the ability of 



CY588 to respond to those peptides that stimulate the pheromone response pathway by 
growing on selective media. These experiments were performed using an oligonucleotide 
which encodes the 13 amino acid a-factor; i.e., the degenerate sequence (NNN) n in the 
oligonucleotide cloned into Cadus 1215 (see above) was specified (n=13) to encode this 
pheromone. CY588 was transformed with the resulting plasmid (Cadus 1219), and 
transformants selected on uracil-deficient medium were transferred to histidine-deficient 
medium supplemented with a range of concentrations of aminotriazole (an inhibitor of the 
HIS3 gene product that serves to reduce background growth). The results demonstrate that 
the synthetic oligonucleotide, expressed in the context of MFal by Cadus 1215, conferred 
upon CY588 an ability to grow on histidine-deficient media supplemented with 
aminotriazole. In summation, these data indicate that: 1. CY588 is competent for the 
secretion of a peptide encoded by the (NNN) n sequence of the synthetic oligonucleotide 
cloned into and expressed from Cadus 1215; and 2. CY588 can, in an autocrine fashion, 
respond to a secreted peptide which stimulates its pheromone response pathway, in this case 
by a-factor binding to STE2. 

Additional experiments were performed to test the utility of autocrine yeast strains in 
identifying agonists of the Ste2 receptor from among members of two semi-random a-factor 
libraries, cc-Mid-5 and MFa-8. 

a- Mid- 5 Library 

A library of semi-random peptides, termed the a-Mid-5 library, was constructed. In 
this library, the N-terminal four amino acids and the C-terminal four amino acids of a 13 
residue peptide are identical to those of native a-factor while the central five residues 
(residues 5-9) are encoded by the degenerate sequence (NNQ) 5 . The following 
oligonucleotides were used in the construction of the a-Mid-5 library: 

(1) MFaMbo, a 17 men 

5' GCGTCACAGACTGATCA 

(2) MID5ALF, a 71 men 
5' 

GCCGTCAGTAAA<X?TTGGCATTGGTTGNNfQN 
CTGATC AGTCTGTGACGC 

Sequenase (United States Biochemical Corporation, Cleveland, Ohio) was used to 
complete the duplex formed after annealing MFaMbo to the MID5ALF oligonucleotide. In 
the MID5ALF sequence, N indicates a mixture of A, C, G, and T at ratios of 0.8:1:1.3:1; Q 
indicates a mixture of C and G at a ratio of 1 : 1 .3. These ratios were employed to compensate 
for the different coupling efficiences of the bases during oligonucleotide synthesis and were 
thus intended to normalize the appearance of all bases in the library. The double-stranded 
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oligonucleotide was restricted with Hindlll and Mbol and ligated to Cadus 1625 (see above); 
Cadus 1625 had been prepared to accept the semi-random oligonucleotides by restriction 
with Hindlll and Bglll. 

The apparent complexity of the ctMid-5 library is 1 x 10 7 . This complexity is based 
on the number of bacterial transformants obtained with the library DNA versus transformants 
obtained with control vector DNA that lacks insert. Sequence analysis of six clones from the 
library demonstrated that each contained a unique insert. 

To identify peptide members of the a-mid-5 library that could act as agonists on the 
STE2 receptor, CY1455, a high transforming version of CY588, was electroporated to 
enhance uptake of a-Mid-5 DNA. Transformants were selected on uracil-deficient (-Ura) 
synthetic complete medium and were transferred to histidine-deficient (-His) synthetic 
complete medium supplemented with 0.5mM or ImM aminotriazole. 

Yeast able to grow on -His + aminotriazole medium include (1) yeast which are 
dependent on the expression of an ct-factor variant agonist and (2) yeast which contain 
mutations that result in constitutive signalling along the pheromone pathway. Yeast 
expressing and secreting a variant STE2 receptor agonist have the ability to stimulate the 
growth on -His medium of surrounding CY 1455 cells that do not express such an agonist. 
Thus a recognizable formation (termed a "starburst") results, consisting of a central colony, 
growing by virtue of autocrine stimuation of the pheromone pathway, surrounded by satellite 
colonies, growing by virtue of paracrine stimulation of the pheromone pathway by the 
agonist peptide as that peptide diffuses radially from the central, secreting colony. 

In order to identify the peptide sequence responsible for this "starburst" phenomenon, 
yeast were transferred from the center of the "starburst" and streaks were made on -Ura 
medium to obtain single colonies. Individual clones from -Ura were tested for the His+ 
phenotype on -His + aminotriazole plates containing a sparse lawn of CY1455 cells. 
Autocrine yeast expressing a peptide agonist exhibited the "starburst" phenotype as the 
secreted agonist stimulated the growth of surrounding cells that lacked the peptide but were 
capable of responding to it. Constitutive pheromone pathway mutants were capable of 
growth on -His + aminotriazole but were incapable of enabling the growth of surrounding 
lawn cells. 

Alternatively, streaks of candidate autocrine yeast clones were made on plates 
containing 5-fluoroorotic acid (FOA) to obtain Ura segregants were retested on -His + 
aminotriazole for the loss of the His+ phenotype. Clones that lost the ability to grow on -His 
+ aminotriazole after selection on FOA (and loss of the peptide-encoding plasmid) derived 
from candidate expressors of a peptide agonist. The plasmid was rescued from candidate 
clones and the peptide sequences determined. In addition, a plasmid encoding a putative 
Ste2 agonist was reintroduced into CY1455 to confirm that the presence of the plasmid 
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encoding the peptide agonist conferred the His+ phenotype to CY1455. 

By following the above protocol novel Ste2 agonists have been identified from the oc-Mid-5 
library. Sequences of nine agonists follow, preceded by the sequence fo the native a-factor 
pheromone and by the oligonucleotide used to encode the native pheromone in these 
experiments. (Note the variant codons used in the a-factor-encoding oligonucleotide for 
glutamine and proline in the C-terminal amino acids of a-factor). 

a-factor TGG CAT TGG TTG CAG CTA AAA CCT GGC CAA CCA ATG TAC 
encodes Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 


a-factor 



oligo: 
encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

CAG 

Gin 

CTA 
Leu 

AAA 

Lys 

CCT 
Pro 

GGC 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


Ml 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

TCC 

Ser 

TTG 

Leu 

TCG 

Ser 

CCC 
Pro 

GGG 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


M2 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

TCC 

Ser 

CTG 
Leu 

GAC 

Asp 

GCT 
Ala 

GGC 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


M3 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

ACC 

Xhr 

TTG 

Leu 

ATG 

Met 

GCC 
Ala 

GGG 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


M4 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

CAG 
Gin 

CTG 
Leu 

TCG 

Ser 

GCG 
Ala 

GGC 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 

Met , 

TAC 
Tyr 


MS 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

AGG 

Arg 

TTG 

Leu 

CAG 
Gin 

TCC 

Ser 

GGC 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


M6 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

CGC 
Arg 

TTG 

Leu 

TCC 
Gin 

GCC 
Ser 

GGG 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


M7 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

TCG 

Ser 

CTC 

Leu 

GTC 

Val 

CCG 
Pro 

GGG 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


M8 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

TCC 
Ser 

CTG 

Leu 

TAC 

m 

CCC 
Pro 

GGG 

Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


M9 

encodes 

TGG 
Trp 

CAT 
His 

TGG 
Trp 

TTG 
Leu 

CGG 

Arg 

CTG 

Leu 

CAG 

Gin 

CCC 
Pro 

GGG 
Gly 

CAG 
Gin 

CCT 
Pro 

ATG 
Met 

TAC 
Tyr 


The nine peptide agonists of the Ste2 receptor above were derived from one 
electroporation of CY1455 using 1 jig of the a-Mid-5 library DNA. Approximately 3 x 10 5 
transformants were obtained, representing approximately .03% of the sequences present in 
that library. 


MFa-8 Library 

A semi-random a-factor library was obtained through synthesis of mutagenized ce- 
faclor oligonucleotides such that 1 in 10,000 peptide products were expected to be genuine a 
-factor. The mutagenesis was accomplished with doped synthesis of the oligonucleotides: 
each nucleotide was made approximately 68% accurate by synthesizing the following two 
oligos: 

5' CTGGAIQCGAAGACTCAGCT (20 mer) (oligo060) 

5' CGGATGATCA gta cat tgg ttg gec agg ttt tag ctg caa cca atg cca AGC TGA GTC 
TTC GCA TCC AG (69 mer) (oligo074) 

The lower case letters indicate a mixture of 67% of that nucleotide and 1 1% of each of the 
other three nucleotides (e.g. t indicates 67% T and 1 1% A, 1 1% C, and 1 1% G). Note that 
digestion of the double-stranded oligo-nucleotide by Fokl or Bbsl will yield an identical 5* 
end that is compatible with Hindlll ends. 

Oligos 060 and 074 will form the following double-stranded molecule when annealed: 
5'-CTQGAlGCGAAGACTCAGCT 

3 '-GACCTACGCTTCTGAGTCGA acc gta acc aac gtc gat ttt gga ccg gtt ggt tac atg 
ACTAGTAGGC-5' 

The duplex was repetitively filled-in using Taq DNA polymerase (Promega 
Corporation, Madison, Wisconsin). The double-stranded product was restricted with Bbsl 
and Bell and ligated into Hindlll- and Bglll-digested Cadus 1373. The Bglll/Bcll joint 
creates a TGA stop codon for the termination of translation of the randomers. Using this 
approach, the MFa-5.8 library (a library of apparent low complexity based on PCR analysis 
of oligonucleotide insert frequency) was constructed. 

To identify peptide members of the MFa-5.8 library that could act as agonists on the 
STE2 receptor, CY1455, a high transforming version of CY588, was electroporated to 
enhance uptake of MFa-5.8 DNA. Transformants were selected on uracil-deficient (-Ura) 
synthetic complete medium and were transferred to histidine-deficient (-His) synthetic 
complete medium supplemented with 1.0 mM or 2.5 mM aminotriazole. Yeast from 
colonies which were surrounded by satellite growth were transferred as streaks to -Ura 
medium to obtain single colonies. Yeast from single colonies wree then tested for the His+ 
phenotype on -His + aminotriazole plates. Sequence analysis of seven of the plasmids 



rescued from His+ yeast revealed three unique a-factor variants that acted as agonists on the 
STE2 receptor. 

1.4 independent clones had the following sequence: 

TGG CAT TGG CTA CAG CTA ACG CCT GGG CAA CCA ATG TAC 
encoding Trp His Trp Leu Gin Leu Thr Pro Gly Gin Pro Met Tyr 


2.2 independent clones had the following sequence: 

TGG CAT TGG CTG GAG CTT ATG CCT GGC CAA CCA TTA TAC 
encoding Trp His Trp Leu Qlu Leu Met Pro Gly Gin Pro Leu Tyr 

3 - TGG CAT TGG ATG GAG CTA AGA CCT GGC CAA CCA ATG TAC 

encoding Trp His Trp Met filii Leu Arg Pro Gly Gin Pro Met Tyr 

Autocrine Mata strain CY599: 

A MATa strain designed for the expression of peptides in the context of MFA1 (i.e., 
using the MFA1 expression vector, Cadus 1239) has been constructed. The genotype of this 
strain, designated CY599, is MATa mfal mfa2 far 1-1 his3::fUsl-HIS3 ste2-STE3 ura3 
metl adel leu2. In this strain, Cadus 1 147 (see above) was used to replace STE2 with a 
hybrid gene in which the STE3 coding region is under the control of expression elements 
which normally drive the expression of STE2. As a result, the a-factor receptor replaces the 
a-factor receptor. The genes which encode a-factor are deleted from this strain; the farl 
mutation abrogates the arrest of growth which normally follows stimulation of the 
pheromone response pathway; and the FUS1-HIS3 hybrid gene (integrated at the HIS3 locus) 
provides a selectable signal of activation of the pheromone response pathway. CY599 cells 
were expected to be capable of the transport of a-factor or a-factor-like peptides encoded by 
oligonucleotides expressed from Cadus 1239 by virtue of expression of the endogenous yeast 
transporter, Ste6. 

Transport of peptides by the yeast a-factor pathway: 

Experiments were performed to test: 1. the ability of Cadus 1239 to function as a 
vector for the expression of peptides encoded by synthetic oligonucleotides; 2. the suitability 
of the oligonucleotides, as designed, to direct the export of farnesylated, carboxymethylated 
peptides through the pathway normally used by a-factor; 3. the capacity of CY599 to export 
these peptides; and 4. the ability of CY599 to respond to those peptides that stimulate the 
pheromone response pathway by growing on selective media. These tests were performed 
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using an oligonucleotide which encodes the 12 amino acid a-factor; specifically, the 
degenerate sequence (NNN) n in the oligo-nucleotide cloned into Cadus 1239 (see above) 
(with n=12) encodes the peptide component of a-factor pheromone. CY599 was transformed 
with the resulting plasmid (Cadus 1220), and transformants selected on uracil-deficient 
medium were transferred to histidine-deficient medium supplemented with a range of 
concentrations of aminotriazole. The results demonstrate that the synthetic oligonucleotide, 
expressed in the context of MFA1 by Cadus 1220, conferred upon CY599 enhanced 
aminotriazole-resistant growth on histidine-deficient media. In summation, these data 
indicate: 1. Cadus 1220 and the designed oligonucleotide are competent to direct the 
expression and export of a farnesylated, carboxymethylated peptide encoded by the (NNN) n 
sequence of the synthetic oligonucleotide; and 2. CY599 can, in an autocrine fashicn, 
respond to a farnesylated, carboxymethylated peptide that stimulates its pheromone response 
pathway, in this case signaling initiates as a-factor binds to STE3. 

Example 5. Proof of Concept 

This example will demonstrate the utility of the autocrine system for the discovery of 
peptides which behave as functional pheromone analogues. By analogy, this system can be 
used to discover peptides that productively interact with any pheromone receptor surrogates. 

CY588 (see strain 5, Example 2 above) will be transformed with CADUS 1215 
containing oligonucleotides encoding random tridecapeptides for the isolation of functional a 
-factor analogues. C YS99 (see strain 6, Example 2 above) will be transformed with CADUS 
1239 containing oligos of random sequence for the isolation of functional a-factor analogues. 
Colonies of either strain which can grow on histidine-deficient media following 
transformation will be expanded for the preparation of plasmid DNA, and the oligo- 
nucleotide cloned into the expression plasmid will be sequenced to determine the amino acid 
sequence of the peptide which presumably activates the pheromone receptor. This plasmid 
will then be transfected into an isogenic strain to confirm its ability to encode a peptide 
which activates the pheromone receptor. Successful completion of these experiments will 
demonstrate the potential of the system for the discovery of peptides which can activate 
membrane receptors coupled to the pheromone response pathway. 

Random oligonucleotides to be expressed by the expression plasmid CADUS 1215 
will encode tridecapeptides constructed as 

5' CGTGAAGCITAAGCGTGAGGC AG AAGCTHSJNK't 1 3 TGATC A TCCCV where N is any 
nucleotide, K is either T or G at a ratio of 40:60 (see Proc. Natl. Acad. Sci. 87:6378, 1990; 
ibid 89:5393, 1992), and the Aflll and Bell sites are underlined. This oligonucleotide is 
designed such that: the Aflll and Bell sites permit inserting the oligos into the Aflll and Bglll 
site of CADUS 1215, the Hindlll site just 5* to the Aflll site in the 5* end of the oligo allows 
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future flexibility with cloning of the oligos; the virtual repeat of GAGGCT and the GAGA 
repeats which are present in the wild-type sequence and which can form triple helixes are 
changed without altering the encoded amino acids. The random oligonucleotides described 
above will actually be constructed from the following two oligos: 

5* CGTGAAGCTTAAGCGTGAGGCAGAAGCT and 

5' CGGATGATCA(MNN) 13 AGCTTCTG , 

where M is either A or C at a ratio of 40:60. The oligos will be annealed with one another 
and repetitively filled in, denatured, and reannealed (Kay et al, Gene, 1993). The double- 
stranded product will be cut with Aflll and Bell and ligated into the Aflll- and Bglll-digested 
CADUS 1215. The Bglll/Bcll joint will create a TGA stop codon for termination of 
translation of the randomers. Because of the TA content of the Afl overhang, the oligos will 
be ligated to the Aflll-and Bglll-digested pADC-MFa at 4° C. 

Random oligonucleotides to be expressed by the expression plasmid CADUS 1239 
will encode monodecapeptides constructed as 

5* GGTA CTCG A Q TGAAAAGAAGGAC AAC(NNK) 1 , TGTGTTATT GCTTA AP T A CG : 

where N is any nucleotide, K is either T or G at a ratio of 40:60 (see Proc. Natl. Acad, set 
87:6378, 1990; ibid 89:5393, 1992). When cloned into the Xhol and Aflll sites of CADUS 
1239 the propeptides expressed under the control of the ADH1 promoter will contain the 
entire leader peptide of MFal, followed by 1 1 random amino acids, followed by triplets 
encoding CVIA (the C-terminal tetrapeptide of wild-type a-factor). Processing of the 
propeptide should result in the secretion of dodecapeptides which contain 11 random amino 
acids followed by a C-terminal, farnesylated, carboxymethylated cysteine. 

Using the procedure described above, the oligonucleotides for expression in CADUS 
1239 will actually be constructed from the following two oligos: 

5' GGTACTCGAGTGAAAAGAAGGACAAC and 

5' CGTACTTAAGCAATAACAca(MNN) u GTTGTCC, 

where M is either A or C at a ratio of 40:60, and the Xhol and Aflll sites are 
underlined. 

Discovery of a-factor analoques from a random peptide library 

An optimized version of strain 6 (Example 2 above) was derived. This yeast strain, 
CY2012 (MATa ste2-STE3 far J A1442 mfal::LEU2 mfa2-lacZ fusl-HIS3 tbtl-1 ura3 leu2 his3 
trpl suc2), was constructed as follows. From a cross of CY570 (MATa mfal::LEU2 mfa2- 
lacZura3 trpl his3A200 canl leu2 Jusl-HIS3 [MFA1 URA3 2u] [FuslA8-73 TRP1 CEN6]) by 
CY1624 (MATa tbtl-1 fusl-HIS3 trpl ura3 leu2 his3 lys2-801 SUC+), a spore was selected 
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(CY1877) of the following genotype: MATa mfal::LEU2 mfa2-lacZ fitsl-HIS3 tbtl-1 ura3 
leu2 his3 trpl suc2. This strain lacks both genes (NFA1 and MFA2) encoding a-factor 
precursors, contains the appropriate pheromone pathway reporter gene (fusl-HlS3) t and 
transforms by electroporation at high efficiency (tbtl-l). This strain was altered by deletion 
of the FAR1 gene (with Cadus 1442; see Example 6), and replacement of STE2 coding 
sequences with that of STE3 (see Example 1) to yield CY2012. 

This strain was transformed with plasmid DNA from a random a-factor library by 
electroporation and plated on 17 synthetic complete plates lacking uracil (-Ura), yielding 
approximately 10 5 Ura+colonies per plate after 2 days at 30°C. These colonies were replica 
plated to histidine-deficient synthetic complete media (-His) containing 0.2 mM 3- 
aminotriazole and after three days at 30°C 35 His+ replicas were streaked to -Ura plates. The 
resultant colonies, 3 from each isolate, were retested for their His+ phenotype, and streaked 
to 5-fluoroorotic acid plates to obtain Ura segregants (lacking a library plasmid). Those Ura- 
segregants were tested for the loss of their His+ phenotype. Ten of the original isolates 
passed these tests; in two cases only one of the three Ura+ colonies purified from the isolate 
retained the His+ phenotype, but nevertheless subsequently segregated Ura His- colonies. 

A single plasmid (corresponding to a bacterial colony) was obtained from each of the 
ten isolates, and reintroduced into CY2012. Eight of the ten plasmids passed the test of 
retaining the ability to confer the His+ phenotype on CY2012 (the two that failed correspond 
to the two isolates that were mentioned above, suggesting that these isolates contain at least 
one "irrelevant' plasmid). Sequencing of the randomized insert in the eight plasmids of 
interest revealed that four contain the sequence: 

TAT GCT CTG TTT GTT CAT TTT TTT GAT ATT CCG 

Tyr Ala Leu Phe Val His Phe Phe Asp lie Pro 

two contain the sequence: 
TTT AAG GGT CAG GTG CGT TTT GTG GTT CTT GCT 
Phe Lys Gly Gin Val Arg Phe Val Val Leu Ala, 

and two contain the sequence: 
CTT ATG TCT CCG TCT TTT TTT TTT TTG CCT GCG 
Leu Met Ser Pro Ser Phe Phe Phe Leu Pro Ala 

Clearly, these sequences encode novel peptides, as the native a-factor sequence differs 
considerably: 

Tyr lie He Lys Gly Val Phe Trp Asp Pro Ala. 

The a-factor variants identified from random peptide libraries have utility as 
"improved" substrates of ABC transporters expressed in yeast. For example, identification of 
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a preferred substrate of human MDR, one that retains agonist activity on the pheromone 
receptor, would permit the establishment of robust yeast screens to be used in the discovery 
of compounds that affect transporter function. 

Example 6: Functional Expression of a Mammalian G Protein-Coupled Receptor and 
Ligand in an Autocrine Yeast Strain. 

This example details the following: (1) expression of human C5a receptor in yeast; 
(2) expression of the native ligand of this receptor, human C5a, in yeast; and (3) activation of 
the endogenous yeast pheromone pathway upon stimulation of the C5a receptor by C5a when 
both of these molecules are expressed within the same strain of autocrine yeast. Following 
the experimental data we outline the utility of autocrine strains of yeast that functionally 
express the human C5a receptor. 

Human C5a is a 74 amino acid polypeptide that derives from the fifth component of 
complement during activation of the complement cascade; it is the most potent of the 
complement-derived anaphylatoxins. C5a is a powerful activator of neutrophils and 
macrophage functions including production of cytotoxic super oxide radicals and induction 
of chemotaxis and adhesiveness. In addition C5a stimulates smooth muscle contraction, 
induces degranulation of mast cells, induces serotonin release from platelets and increases 
vascular permeability. The C5a anaphylatoxin can also amplify the inflammatory response 
by stimulating the production of cytokines. As C5a is a highly potent inflammatory agent, it 
is a primary target for the development of antagonists to be used for intervention in a variety 
of inflammatory processes. 

The C5a receptor is present on neutrophils, mast cells, macrophages and smooth 
muscle cells and couples through G proteins to transmit signals initiated through the binding 
ofC5a. 

Expression of the C5a Receptor 

The plasmid pCDM8-C5aRc, bearing cDNA sequence encoding the human C5a 
receptor, was obtained from N. Gerard and C. Gerard (Harvard Medical School, Boston, MA) 
(Gerard and Gerard 1991). Sequence encoding C5a was derived from this plasmid by PCR 
using VENT polymerase (New England Biolabs Inc., Beverly MA), and the following 
primers: 

#i-ggt;gggagggtgctcxctagaaggaagtgttcacc 
#2-gcccaggagaccaga£calgfiactccttcaattataccacc 


Primer #1 contains a single base-pair mismatch (underlined) to C5a receptor cDNA. It 


introduces an Xbal site (in bold) 201 bp downstream from the TAG termination codon of the 
C5a receptor coding sequence. Primer #2 contains two mismatched bases and serves to 
create an Ncol site (in bold) surrounding the ATG initiator codon (double underlined). The 
second amino acid is changed from an aspartic acid to an asparagine residue. This is the only 
change in primary amino acid sequence from the wild type human C5a receptor. 

The PCR product was restricted with Ncol and Xbal (sites in bold) and cloned into 
CADUS 1002 (YEpSlNco), a Gal 10 promoter expression vector. The sequence of the entire 
insert was determined by dideoxy sequencing using multiple primers. The sequence between 
the Ncol and Xbal sites was found to be identical to the human C5a receptor sequence that 
was deposited in GenBank (accession #J05327) with the exception of those changes encoded 
by the PCR primers. The C5a receptor-encoding insert was transferred to CADUS 1289 
(pLPXt), a PGK promoter expression vector, using the Ncol and Xbal sites, to generate the 
C5a receptor yeast expression clone, CADUS 1303. 

A version of the C5a receptor which contains a yeast invertase signal sequence and a 
myc epitope tag at its amino terminus was expressed in Cadus 1270-transferred yeast under 
control of a GAL 10 promoter. Plasmids encoding an untagged version of the C5a receptor 
and a myc-tagged derivative of FUS1 served as controls. The expression of the tagged 
receptor in yeast was confirmed by Western blot using the anti-myc monoclonal antibody 
9E10. In the lane containing the extract from the Cadus 1270-transformant, the protein that 
is reactive with the anti-myc monoclonal antibody 9E10 was approximately 40 kD in size, as 
expected. Note that this receptor construct is not identical to the one used in the autocrine 
activation experiments. That receptor is not tagged, does not contain a signal sequence and is 
driven by the PGK promoter. 

Expression of the Ligand, C5a 

A synthetic construct of the sequence encoding C5a was obtained from C. Gerard 
(Harvard Medical School, Boston, MA). This synthetic gene had been designed as a FLAG- 
tagged molecule for the secretion from E. coli (Gerard and Gerard (1990) Biochemistry 
29:9274-9281). The C5a coding region, still containing E. coli codon bias, was amplified 
using VENT polymerase (New England Biolabs Inc., Beverly MA) through 30 cycles using 
the following primers: 

C5a5' = CCCCTTAAGCGTGAGGCAGAAGCTACTCTGCAAAAGAAGATC 
C5a3 ' = GAAGATCTTC AGCGGCCGAGTTGCATGTC 

A PCR product of 257 bp was gel isolated, restricted with Aflll and Bglll, and cloned 
into CADUS 1215 (an expression vector designed to express peptide sequences in the context 
of Mfa) to yield CADUS 1297. The regions of homology to the synthetic C5a gene are 
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underlined. The 5' primer also contains pre-pro a-factor sequence. Upon translation and 
processing of the pre-pro a-factor sequence, authentic human C5a should be secreted by 
yeast containing CADUS 1297. The insert sequence in CADUS 1297 was sequenced in both 
orientations by the dideoxy method and found to be identical to that predicted by the PCR 
primers and the published sequence of the synthetic C5a gene (Franke et al. (1988) Methods 
in Enzymology 162: 653-668). 

Two sets of experiments, aside from the autocrine activation of yeast detailed below, 
demonstrated that CADUS 1297 can be used to express C5a in yeast. 

1) . C5a was immunologically detected in both culture supernatant and lysed cells 
using a commercially available enzyme-linked immunosorbent assay (ELISA)(Table 1). 
This assay indicated the concentration of C5a in the culture supernatant to be approximately 
50 to 100 nM. In comparison, in data derived from mammalian cells, the binding constant of 
C5a to its receptor is 1 nM (Boulay et al.(1991) Biochemistry 30:2993-2999. 

2) . C5a expressed in yeast was shown to compete for binding with commercially 
obtained (Amersham Corporation, Arlington Heights, IL), radiolabeled C5a on induced 
HL60 cells. 

Activation of the Pheromone Response Pathway in Autocrine Yeast Expressing the Human 
C5a Receptor and Human C5a 

Activation of the yeast pheromone response pathway through the interaction of C5a 
with the C5a receptor was demonstrated using a growth read-out. The strain used for this 
analysis, CY455 (MATa tbtl-1 ura3 leu2 trpl his3 fusl-HIS3 canl stel4::TRPl ste3*1156) 
contains the following significant modifications. A pheromone inducible HIS 3 gene , fusl- 
HIS3, is integrated at the Fusl locus. A hybrid gene containing sequence encoding the first 
41 amino acids of GPA1 (the yeast Get subunit) fused to sequence encoding human Gai2a 
(minus codons for the N-terminal 33 amino acids) replaces GPA1 at its normal chromosomal 
location. The yeast STE14 gene is disrupted to lower the basal level of signaling through the 
pheromone response pathway. The yeast a-factor receptor gene, STE3, is deleted. The last 
two modifications are probably not essential, but appear to improve the signal-to-noise ratio. 

CY455 (MATa tbtl-1 ura3 leu2 trpl his3 fusl-HIS3 canl stel4::TRPl ste3*1156) 
was transformed with the following plasmids: 

Cadus 1289 + Cadus 1215 = Receptor Ligand"= (R-L-) 
Cadus 1303 + Cadus 1215 = Receptor + Ligand" = R+L- 
Cadus 1289 + Cadus 1297 = Receptor Ligand + = (R-L+) 
Cadus 1303 + Cadus 1297 = Receptor" 4 " Ligand += (R+L+) 
Receptor refers to the human C5a receptor. 
Ligand refers to human C5a. 



Three colonies were picked from each transformation and grown overnight in media 
lacking leucine and uracil, at pH 6.8 with 25 mM PIPES (LEU URA pH6.8 with 25 mM 
PIPES). This media was made by adding 0.45 ml of sterile 1M KOH and 2.5 ml of sterile 
1M PIPES pH 6.8 to 100 ml of standard SD LEU- URA- media. After overnight growth the 
pH of this media is usually acidified to approximately pH 5.5. Overnight cultures were 
washed once with 25 mM PIPES pH 6.8 and resuspended in an equal volume of media 
lacking leucine, uracil and histidine (LEU URA HIS pH 6.8 with 25 mM PIPES). The 
optical density at 600 nm of a 1/20 dilution of these cultures was determined and the cultures 
were diluted into 25 mM PIPES pH 6.8 to a final OD600 of °- 2 * A volume (5ul) of this 
dilution equivalent to 10,000 cells was spotted onto selective (HIS+ TRP- pH6.8) plates. 
Only those strains expressing both C5a and its receptor (R+L+) show growth on the selective 
plates which lack histidine. All test strains are capable of growth on plates containing 
histidine. The R+L+ strain will grow on plates containing up to 5 mM aminotriazole, the 
highest concentration tested. 

For verification of pheromone pathway activation and quantification of the 
stimulation, the activity of the fusl promoter was determined colorometrically using a fusl- 
lacZ fusion in a similar set of strains. CY878 (MATcc tbtl-1 fusl-HIS3 caNl 
stel4::trpl::LYS2 ste3*1156 gpal(41)-Gai2) was used as the starting strain for these 
experiments. This strain is a trpl derivative of CY455. The transformants for this 
experiment contained CADUS 1584 (pRS424-fusl-lacZ) in addition to the receptor and 
ligand plasmids. Four strains were grown overnight in SD LEU URA TRP pH6.8 with 
50mM PIPES to an OD 600 of less than 0.8. Assay of P-galactosidase activity (Guarente 
1983) in these strains yields the data shown in Figure 4. The coupling of the C5a receptor to 
Got chimeras is shown in Table 2. 

Projected Uses of the Autocrine C5a Strains: 

A primary use of the autocrine C5a strains will be in the discovery of C5a 
antagonists. Inhibitors of the biological function of C5a would be expected to protect against 
tissue damage resulting from inflammation in a wide variety of inflammatory disease 
processes including but not limited to: respiratory distress syndrome (Duchateau et al. (1984) 
Am Rev Respir Dis 130:1058); (Hammerschmidt et al. (1980) Lancet 1:947), septic lung 
injury (Olson et al. 1985) Ann Surg 202:771), arthritis (Banerjee et al. (1989) J. Immuinol 
142:2237), ischemic and post-ischemic myocardial injury (Weisman (1990) Science 
146:2490; (Crawford et al. (1988) Circulation 78:1449) and burn injury (Gelfand et al. (1982) 
J. Clin Invest 70:1170). 

The autocrine C5a system as described can be used to isolate C5a antagonists as 
follows: 
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1. High throughput screens to identify antagonists of C5a. 

A straightforward approach involves screening compounds to identify those which 
inhibit growth of the R+L+ strain described above in selective media but which do not inhibit 
the growth of the same strain or of a R+L- strain in non-selective media. The counterscreen 
is necessary to eliminate from consideration those compounds which are generally toxic to 
yeast. Initial experiments of this type have led to the identification of compounds with 
potential therapeutic utility. 

2. Identification of antagonists usi ng negative selection. 

Replacement of the fusl-HIS3 read-out with one of several negative selection 
schemes (fusl-URA3/FOA, fusl-GALl /galactose or deoxygalactose, Farl sst2 or other 
mutations that render yeast supersensitive for growth arrest) would generate a test system in 
which the presence of an antagonist would result in the growth of the assay strain. Such an 
approach would be applicable to high-throughput screening of compounds as well as to the 
selection of antagonists from random peptide libraries expressed in autocrine yeast. 
Optimization of screens of this type would involve screening the R+L+ strain at a 
concentration of aminotriazole which ablates growth of the R+L- strain (we are currently 
using 0.6 to 0.8 mM) and counterscreening the R+L- strain at a concentration of 
aminotriazole which gives an identical growth rate (we are using 0.14 mM). In addition, the 
system could employ one of several colorometric, fluorescent or chemiluminescent readouts. 
Some of the genes which can be fused to the fusl promoter for these alternate read-outs 
include lacZ (colorometric and fluorescent substrates), glucuronidase 20 (colorometric and 
fluorescent substrates), phosphatases (e.g. PH03, PH05, alkaline phosphatase; colorometric 
and chemiuminescent substrates), green protein (endogenous fluorescence), horse radish 
peroxidase (colorometric), luciferase (chemiluminescence). 

The autocrine C5a strains have further utility as follows: 

3- In the i dentification of novel C5a agonists from random peptide libraries expressed in 
autocrine yeast. 

Novel peptide agonists would contribute to structure/function analyses used to guide 
the rational design of C5a antagonists. 

4. In the identification of receptor mutants. 

Constitutively active, that is, ligand independent, receptors may be selected from 
highly mutagenized populations by growth on selective media. These constitutively active 
receptors may have utility in permitting the mapping of the sites of interaction between the 
receptor and the G-protein. Identification of those sites may be important to the rational 


design of drugs to block that interaction. In addition, receptors could be selected for an 
ability to be stimulated by some agonists but not others or to be resistant to antagonist. 
These variant receptors would aid in mapping sites of interaction between receptor and 
agonist or antagonist and would therefore contribute to rational drug design efforts. 

5. In the identification of molecules that interact with Gm2. 

Compounds or peptides which directly inhibit GDP exchange from Gai2 would have 
the same effect as C5a antagonists in these assays. Additional information would distinguish 
inhibitors of GDP exchange from C5a antagonists. This information could be obtained 
through assays that determine the following: 

1 . inhibition by test compounds of Gai2 activation from other receptors, 

2. failure of test compounds to compete with radiolabeled C5a for binding to the C5a 
receptor, 

3. failure of test compounds to inhibit the activation of other Got subunits by C5a, 

and 

4. inhibition by test compounds of signalling from constitutively active versions of 
C5a, or other, receptors. 

Example 7: Construction of Xybrid Ga Genes Construction of two sets of chimeric 
yeast/mammalian Ga genes, GPA 41 -Ga and GPAl Bam -Ga. 

The Ga subunit of heterotrimeric G proteins must interact with both the py complex and the 
receptor. Since the domains of Ga required for each of these interactions have not been 
completely defined and since our final goal requires Ga proteins that communicate with a 
mammalian receptor on one hand and the yeast py subunits on the other, we desired to derive 
human-yeast chimeric Ga proteins with an optimized ability to perform both functions. 
From the studies reported here we determined that inclusion of only a small portion of the 
amino terminus of yeast Ga is required to couple a mammalian Ga protein to the yeast Py 
subunits. It was anticipated that a further benefit to using these limited chimeras was the 
preservation of the entire mammalian domain of the Got protein believed to be involved in 
receptor contact and interaction. Thus the likelihood that these chimeras would retain their 
ability to interact functionally with a mammalian receptor expressed in the same yeast cell 
was expected to be quite high. 

Plasmid constructions. 

pRS416-GiM7 (Cadus 1069). An Xbal - Sad fragment encoding the entire GPA1 
promotor region, coding region and approximately 250 nucleotides of 3' untranslated region 
was excised from 10 YCplaclll-GPAl (from S. Reed, Scripps Institute) and cloned into 


YEp vector pRS416 (Sikorski and Hieter, Genetics 122: 19 (1989)) cut with Xbal and SacL 


Site-directed mutagenesis f GPA 1 (Cadus 1075, 1121 and 1122). A1.9kbEcoRI 
fragment containing the entire GPA1 coding region and 200 nucleotides from the 5' 
untranslated region was cloned into EcoRI cut, phosphatase-treated p ALTER- 1 (Promega) 
and transformed by electroporation (Biorad Gene Pulser) into DH5ctF f bacteria to yield 
Cadus 1075. Recombinant phagemids were rescued with M13K07 helper phage and single 
stranded recombinant DNA was extracted and purified according to the manufacturer's 
specifications. A new Ncol site was introduced at the initiator methionine of GPA1 by 
oligonucleotide directed mutagenesis using the synthetic oligonucleotide: 

5' GATATATTAAGGTAGGAA ACCATGG GGTGTACAGTGAG 3'. 

Positive clones were selected in ampicillin and several independent clones were sequenced in 
both directions across the new Ncol site at +1. Two clones containing the correct sequences 
were retained as Cadus 1121 and 1 122. 

Construction of a GPAl-based expression vector (Cadus 1127). The vector used 
for expression of full length and hybrid mammalian Got proteins in yeast, Cadus 1 127, was 
constructed in the following manner. A 350 nucleotide fragment spanning the 3' untranslated 
region of GPA1 was amplified with Taq polymerase (AmpliTaq; Perkin Elmer) using the 
oligonucleotide primers A (5' CGAGGCTCGAGGGAACGTATAATTAAAGTAGTG 3') 
and B (5* GCGCGGTACCAAGCTTCAATTCGAGATAATACCC 3'). The 350 nucleotide 
product was purified by gel electrophoresis using GeneClean II (Biol 01) and was cloned 
directly into the pCRII vector by single nucleotide overlap TA cloning (InVitrogen). 
Recombinant clones were characterized by restriction enzyme mapping and by 
dideoxynucleotide sequencing. Recombinant clones contained a novel Xhol site 5* to the 
authentic GPA1 sequence and a novel Kpnl site 3' to the authentic GPA1 sequence donated 
respectively by primer A and primer B. 

The NotI and SacI sites in the polylinker of Cadus 1013 (pRS414) were removed by 
restriction with these enzymes followed by filling in with the Klenow fragment of DNA 
polymerase I and blunt end ligation to yield Cadus 1092. The 1.4 kb PstI - EcoRI 5' 
fragment of GPA1 from YCplaclll- GPA J containing the GPA1 promoter and 5' 
untranslated region of GPA I was purified by gel electrophoresis using GeneClean (BiolOl) 
and cloned into PstI - EcoRI restricted Cadus 1013 to yield Cadus 1087. The PCR amplified 
Xhol - Kpnl fragment encoding the 3' untranslated region of GPA1 was excised from Cadus 
1089 and cloned into Xhol- Kpnl restricted Cadus 1087 to yield Cadus 1092. The Notl 


and Sacl sites in the poly linker of Cadus 1092 were removed by restriction with these 

enzymes, filling in with the Klenow fragment of DNA polymerase I, and blunt end ligation to 

yield Cadus 1110. The region of Cadus 1 122 encoding the region of GPA1 from the EcoRI 

site at -200 to +120 was amplified with Vent DNA polymerase (New England Biolabs, 

Beverly, MA) with the primers 

5' CCCGAATCCACCAATTTCTTTACG 3*and 

5' GCGGCGTCGACGCGGCCGCGTAACAGT 3*. 

The amplified product, bearing an EcoRI site at its 5' end and novel Sacl, NotI and 
Sail sites at its 3 1 end was restricted with EcoRI and Sail, gel purified using GeneClean II 
(BiolOl), and cloned into EcoRI and Sail restricted Cadus 1110 to yield Cadus 1127. The 
DNA sequence of the vector between the EcoRI site at -200 and the Kpnl site at the 3' end of 
the 3' untranslated region was verified by restriction enzyme mapping and dideoxynucleotide 
DNA sequence analysis. 

PCR amplification of GPA 41 -Ga proteins and cloning into Cadus 1127. cDNA 
clones encoding the human G alpha subunits Gas, Gai2, Gai3, and S. cerevisiae GPA1 were 
amplified with Vent thermostable polymerase (New England Bioloabs, Beverly, MA). The 
primer pairs used in the amplification are as follows: 

GaS Primer 1 : 5'CTGCTGGAGCTCCGCCTGCTGCTGCTGGGTGCTGGAG3* (Sacl 5') 

Primer 2 : 5'CTGCTGGTCGACGCGGCCGCGGGGGTTCCTTCTTAGAAGCAGC3* 
(Sail 3') 

Primer 3 : 5'GGGCTCGAGCCTTCTTAGAGCAGCTCGTAC3' (Xhol 3') 

Gai2 Primer 1 : 5 'CTGCTGG AGCTC AAGTTGCTGCTGTTGGGTGCTGGGG3 1 (SacI5') 
Primer 2 : 5'CTGCTGGTCGACGCGGCCGCGCCCCTCAGAAGAGGCCGCGGT 
CC3' (Sail 3*) 

Primer 3 : 5 'GGGCTCGAGCCTC AG AAGAGGCCGC AGTC3 ' (Xhol 3') 

Gai3 Primer 1: 5 'CTGCTGGAGCTC AAGCTGCTGCTACTCGGTGCTGGAG3 ' (SacI5') 
Primer 2 : S'CTGCTGGTCGACGCGGCCGCCACTAACATCCATGCTTCTCAAT 

AAAGTC3* (Sail 3*) 

Primer 3 : 5'GGGCTCGAGCATGCTTCTCAATAAAGTCCAC3' (Xhol 3') 

After amplification, products were purified by gel electrophoresis using GeneClean II 
(BiolOl^) and were cleaved with the appropriate restriction enzymes for cloning into Cadus 
1127. 

The hybrid GPA 41 -G a subunits were cloned via a Sacl site introduced at the desired 
position near the 5* end of the amplified genes and a Sail or Xhol site introduced in the 3' 
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untranslated region. Ligation mixtures were electroporated into competent bacteria and 
plasmid DNA was prepared from 50 cultures of ampicillin resistant bacteria. 

Construed n of Integrating Vectors Enc ding GPA 4 i-G a Subunits. The coding region of 
each GPA 41 -G a hybrid was cloned into an integrating vector (pRS406 = URA3 AmpR) using 
the BssHII sites flanking the polylinker cloning sites in this plasmid. Cadus 1011 (pRS406) 
was restricted with BssHII, treated with shrimp alkaline phosphatase as per the 
manufacturer's specifications, and the linearized vector was purified by gel electrophoresis. 
Inserts from each of the GPA 41 -G a hybrids were excised with BssHII from the parental 
plasmid, and subcloned into gel purified Cadus 1011. 

Construction of GPA BAM -Gct Constructs. A novel BamHI site was introduced in 
frame into the GPA1 coding region by PCR amplification using Cadus 1179 (encoding a 
wildtype GPA1 allele with a novel Ncol site at the initiator methionine) as the template, 
VENT polymerase, and the following primers: Primer A = 5' 
GCATCCATCAATAATCCAG 3' and Primer B = 5* GAAACAATGGA - 
TCCACTTCTTAC 3'. The 1.1 kb PCR product was gel purified with GeneClean II 
(Biol 01), restricted with Ncol and BamHI and cloned into NcoI-BamHI cut and 
phosphatased Cadus 1 122 to yield Cadus 1605. The sequence of Cadus 1605 was verified by 
restriction analysis and dideoxy-sequencing of double-stranded templates. Recombinant 
GPA Bam -Ga hybrids of Gas, Gai2, and Gal 6 were generated. Construction of Cadus 1855 
encoding recombinant GPA Bam -Ga 16 serves as a master example: construction of the other 
hybrids followed an analogous cloning strategy. The parental plasmid Cadus 1617, encoding 
native Gal 6, was restricted with Ncol and BamHI, treated with shrimp alkaline phosphatase 
as per the manufacturer's specifications and the linearized vector was purified by gel 
electrophoresis. Cadus 1605 was restricted with Ncol and BamHI and the 1.1 kb fragment 
encoding the amino terminal 60% of GPA1 with a novel BamHI site at the 3 1 end was cloned 
into the Ncol- and BamHI-restricted Cadus 1617. The resulting plasmid encoding the 
GPA Bam -Ga 16 hybrid was verified by restriction analysis and assayed in tester strains ror an 
ability to couple to yeast Gpy and thereby suppress the gpal null phenotype. Two additional 
GPA Bam -Ga hybrids, GPA Bam -Gas and GPA Bam -Gai2, described in this application were 
prepared in an analogous manner using Cadusl606 as the parental plasmid for the 
construction of the GPA Bam -Ga i2 hybrid and Cadus 1181 as the parental plasmid for the 
construction of the GPA Bam -Ga s hybrid. 

Coupling by chimeric Ga proteins. The Ga chimeras described above were tested for the 
ability to couple a mammalian G protein-coupled receptor to the pheromone response 
pathway in yeast. The results of these experiments are outlined in Table 3. Results obtained 
using GPA1 4I -Gai2 to couple the human C5a receptor to the pheromone response pathway in 
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autocrine strains of yeast are disclosed in above. 

Example 8: Screening f r M dulat rs f G-alpha activity 

Screens for modulators of Got activity may also be performed as shown in the 
following examples for illustration purposes, which are intended to be non-limiting. 

Strains CY4874 and CY4877 are isogenic but for the presence of Q205L mutation in 
the cloned Gcc i2 gene cloned into plasmid 1. Strains CY4901 and CY4904 each have a 
chromosomally integrated chimeric Get fusion comprising 41 amino acids of gpal at the N 
terminus of the human Ga i2 gene and are isogenic but for the presence of a constitutively 
activating mutation in the C5a receptor gene of CY4901. Strain CY5058 is a gpal mutant 
which carries only the yeast Gpy subunits and no Got subunit This strain is a control strain 
to demonstrate specificity of action on the Got subunit. 
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I. Suppressi nofActivati nbyMutati n of Get 

The Q205L mutation is a constitutively activated GTPase deficient mutant of the human Ga i2 
gene. Antagonist compounds, chemicals or other substances which act on Gcc i2 can be 
recognized by their action to reduce the level of activation and thus reduce the signal from 
the fusl-lacZ reporter gene on the second plasmid (Plasmid 2). 

A. GTPase Gct i2 Mutants 

test component = gpa 41 -Ga i2 (Q 205 L) 
control component = gpa 41 -Ga i2 

As well as the CY4874 and CY4877 constructs detailed above, similar strains with fusl-His3 
or fus2-CAN-l growth readouts may also be used. The fusl-His3 strains are preferred for 
screening for agonists and the fus2-CANl strains are preferred for antagonist screens. 


Readout 

fusl-HIS3 

fusl-lacZ 
fus2-CANl 


test 

strain 

CY4868 


CY4874 
CY4892 


x i2 antagonist 

inhibit growth of -HIS 
+AT (Aminotriazole) 

reduce p-gal activity 

induce growth on 
canavanine 


control 

strain 

CY4871 


CY4877 
CY4386 


In each case an antagonist should cause the test strain to 
behave more like the control strain. 


Readout 

fusl-HIS3 

fusl-lacZ 

fus2-CANl 


B. GTPase Ga s Mutants (Get Specificity) 

test component = Ga s (Q 227 L) 
control component = Ga s 
lest effect of Gcl i2 antagonist control 

Strain strain 

CY4880 none CY4883 

CY4886 none CY4889 

CY4895 none CY4898 


In each case a non-specific antagonist would cause the test strain to behave more like the 

control strain. 


Additional media requirements: -TRP for Get plasmid maintenance in fusl-HIS3 and 
fus2-CANl screens and -TRP -URA for Ga and fusl-lacZ plasmid maintenance in fusl-lacZ 
screen. 


II. Suppression of Activation by Receptors 

Constitutively Activated C5a Receptors 

test component = C5aR* (P, 84 L, activated C5a Receptor) 
control component = C5aR 

The C5aR* mutation has a Leucine residue in place of the Proline residue of the wild-type at 
position 1 84 of the amino acid sequence. 


R eadout 
fusl-HIS3 

fusl-lacZ 
fus2-CANl 


test 

strain 

CY4029 


CY4901 
CY4365 


x i2 antagonist 

inhibit growth of -HIS 
+AT (Aminotriazole) 

reduce p-gal activity 

induce growth on 
canavanine 


control 

strain 

CY2246 


CY4904 
CY4362 


In each case an antagonist should cause the test strain to 
behave more like the control strain. 


Additional media requirements: -LEU for receptor plasmid maintenance in fusl-HIS3 and 
fus2-CANl screens and -LEU-URA for receptor and fusl-lacZ plasmid maintenance in 
fusl-lacZ screen, non-buffered yeast media (pH 5.5). 


Example 9: Identification of a surrogate ligand using expression of a random peptide 
library in yeast expressing an orphan mammalian receptor 

FPRL-1 (formyl peptide receptor-like 1) is a structural homolog of the formyl peptide 
receptor (FPR). FPR is a G protein-coupled receptor, expressed on neutrophils and 
phagocytic cells, that is stimulated by N-formyl peptides of bacterial origin. Specific binding 
of the natural ligand, f-Met-Leu-Phe, stimulates transduction of a signal to mobilize calcium, 
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resulting in cellular changes including chemotaxis and the release of granule contents. Low 
stringency hybridization of HL60 cDNA libraries with an FPR cDNA probe permitted the 
identification of the related receptor, FPRL-1 (Murphy et al. supra; Ye et al. supra). The 
FPRL-1 cDNA encodes a 351 amino acid protein with 69% sequence homology to FPR 
(Murphy et al. supra) FPR and FPRL-1 were found to co-localize to human chromosome 19 
and to have a tissue expression pattern identical to that of FPR, i.e., expression is restricted to 
cells of myeloid origin (Murphy et al. supra). Ye et al. (supra) demonstrated weak binding 
of f-Met-Leu-Phe (uM concentrations) to fibroblasts transfected with FPRL-1 cDNA. In 
contrast, Murphy et al. (supra) could not detect binding of N-formyl peptides to Xenopus 
oocytes transfected with FPRL-1 cDNA. FPRL-1 appears to be an orphan receptor whose 
specific ligand differs from the formyl peptide ligands to which FPR responds. 

In this example experiments detailing the following will be described: (1) 
establishment of a strain of yeast designed to express the human orphan G protein-coupled 
receptor FPRL-1; (2) expression of a random peptide library in the aforementioned strain of 
yeast; and (3) activation of the endogenous yeast pheromone pathway upon stimulation of the 
FPRL-1 receptor by a peptide encoded by a random library expressed within the same strain 
of yeast. 

Preparation of FPRL-1 Yeast Expression Vector 

A plasmid, pFPRLl-L31, containing a 2.6 kb EcoRI-Xhol fragment encoding the FPRL-1 
cDNA in the BluescriptIISK+ vector was obtained from Philip Murphy (N1H). The sequence 
encoding FPRL1 was amplified by the polymerase chain reaction using VENT polymerase 
(New England Biolabs, Inc., Beverly, MA) through 20 cycles and the following 
oligonucleotide primers: 

#1 5' GGCGCCCGGICICCCATGG AAACCAACTTCTCC ACT 

#2 5* GGCGCCCGGICICCGATCCCATTGCCTGTAACTCAGTCTC 

The PCR product was purified, restricted with Bsal and cloned into Cadus 1651 (plPBX-1), a 
PGK promoter-driven expression vector, using Ncol and BamHI sites, to yield CADUS 
2311. The sequence of the entire insert was determined and found to be identical to the 
FPRL-1 sequence deposited in GenBank (accession number M84562). 

Preparation of Random Oligonucleotides 

Library-Recycling Protocol to Identify a Surrogate Ligand 

The yeast strain CY1141 (MATalpha farl*1441 tbtl-1 fusl-HIS3 can 1 
stel4::trpl:;LYS2 ste3*1156 gpal(41)-Galphai2 lys2 ura3 leu2 trpl his3) was used in the 
experiments that follow. CY1141 contains a pheromone inducible HIS3 gene, fusl-HIS3 
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integrated at the FUS1 locus and a hybrid gene encoding the first 41 amino acids of GPA1 
(yeast G alpha) fused to sequence encoding human G alphai2 (lacking codons encoding the 
N-terminal 33 amino acids) replacing GPA1 at its chromosomal locus. The yeast STEM 
gene is disrupted to lower the basal level of signaling through the pheromone response 
pathway. The yeast a-factor receptor gene, STE3, is deleted. CY1 141 was transformed with 
Cadus 23 1 1 to yield CY6571, a strain expressing the human orphan receptor, FPRL-1. 

CY6571 exhibited LIRMA (ligand independent receptor mediated activation), that is, 
activation of the yeast pheromone pathway in the absence of ligand. It was determined that 
the yeast growth on selective media that resulted from LIRMA was eliminated by the 
additional of 2.5millimolar concentrations of 3-aminotriazole (AT). AT is an inhibitor of the 
HIS3 gene product that serves to reduce background growth. Therefore, selection protocols 
aimed at the identification of surrogate ligands for the FPRL-1 receptor were carried out at 
this concentration of AT. 

CY6571 was inoculated to 10 mis of standard synthetic media (SD) lacking leucine (- 
Leu) and incubated overnight at 30°C. The 10 ml overnight culture was used to inoculate 50 
mis of YEPD; this culture was incubated at 30°C for 4.5-5 hours at which time the cells were 
harvested and prepared for transformation with DNA encoding a random peptide library 
[alpha-NNK (6.24.94)] encoding tridecapeptides of random sequence, by electroporation. 
Post electroporation (in 0.2 cm cuvettes, 0.25 uF, 200Q, 1.5 kV) the cells were immediately 
diluted in 1 ml ice-cold 1M sorbitol and lOOuL aliquots were placed onto 10 synthetic media 
plates (pH6.8) lacking leucine and uracil (-Leu-Ura). The plates were incubated at 30°C for 
2-4 days at which time two replicas of each original transformation plate were made to 
synthetic media (pH6.8) lacking leucine, uracil and histidirie and supplemented with 2.5mM 
AT(-Leu-Ura-His+2.5mM AT). The replicas were incubated at 30°C for 3-5 days. Post 
incubation the colonies present on the replica sets of two were scraped from the plates into a 
total of 10 mis of H2O (5 mis each plate). The OD^qq of each cell suspension was 
determined and crude plasmid isolations were done on 8-16 OD units of cells for each pool. 
A total of eight pools resulted, due to lower numbers of yeast colonies present in four sets of 
plates. The pellets obtained from these crude plasmid isolations (the so called "smash and 
grab" technique, Methods in Yeast Genetics - A Laboratory Manual, 1990, M.D. Rose, F. 
Winston and P. Heiler. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.), 
were resuspended in 40uL of 10 mM Tris, 1 mM EDTA, pH8.0 and luL was used to 
transform E. coli by electroporation (0.1 cm cuvettes, 0.25 up, 200Q, 1.8 kV). Post 
electroporation the cells were immediately diluted into 1 ml 2XYT media and incubated, 
with shaking, at 37°C for 30 minutes after which time the cells were used to inoculate 50 mis 
of 2xYT supplemented with 100 ug/ml ampicillin. The 10 resulting cultures were incubated 
at 37°C overnight. Plasmid DNA was isolated from each of these bacteria cultures using 
Qiagen columns (Qiagen, Inc., Chatsworth, CA)). Each plasmid DNA pellet was 


resuspended in 50uL Tris lOmM, EDTA 1 mM, pH 8.0. 

Strain CY6571 was transformed with luL of each plasmid pool by electroporation. 
Post electroporation the cells were diluted into 400uL 1M sorbitol. From each electroporated 
cell suspension, luL and 400uL of cells were plated on -Leu-Ura synthetic media, pH6.8 to 
yield "low density" and "high density" platings. The plates were incubated at 30°C for 3 
days, at which time replicas of both the low and high density plates were made to -Leu-Ura- 
His+2.5mM AT. For those cases where enrichment for a plasmid capable of conferring a 
His+ phenotype had occurred, this would be reflected by an amplified number of His+ 
colonies on both the low and high density plates visible at days 2-3, although the 
amplification would be most obvious on the plates that had received a high density of cells. 
In the FPRL-1 experiment 1/8 pools showed amplification of His+ colonies. The cells were 
scraped from this plate into 5 mis of H2O, the OD^QO °f me ce ^ suspension was determined 
and a crude plasmid isolation was done on 1 5 OD units of yeast cells. The pellet obtained 
was resuspended in 40uL lOmM Tris, 1 mM EDTA, pH8.0 and luL was used to transform 
E. coli. Plasmid DNA was isolated by miniprep from 3 ml 2XYT cultures of single bacterial 
colonies resulting from this transformation. 10 DNA pellets (Al through A 10) deriving from 
individual bacterial colonies were resuspended in 20uL 10 mM Tris 1 mM EDTA, pH8.0 and 
used to transform CY6571 (containing the FPRL-1 expression vector) and CY6263 (CY1 141 
containing a control expression vector lacking any receptor sequence) by electroporation. 
Cadus 1625, a control vector lacking sequences encoding a peptide, was included and used to 
transform both the receptor-!- and receptor- strains of yeast. Transformants were first selected 
on -Leu-Ura, pH6.8 then three yeast transformants of each type (from 11 CY6571 
transformations and 1 1 CY6263 transformations) were patched to -Leu-Ura, pH6.8 to expand 
the colonies. Once expanded, streaks of the transformants were made on -Leu-Ura- 
His+2.5mM AT to test for growth in the absence of histidine. All piasmids except the one 
denoted A2 conferred a growth advantage on media lacking histidine to yeast bearing the 
FPRL-1 -encoding plasmid but not to yeast lacking the receptor plasmid. The peptide 
sequence found to be encoded by piasmids Al and A3-A10 is: 
Serl^uLeuTrpLeuThrCysArgProTrpGluAlaMet, and is encoded by the nucleotide sequence 
5'-TCT CTG CTT TGG CTG ACT TGT CGG CCT TGG GAG GCG ATG-3'. 

Activation of the Pheromone Response Pathway in Yeas t Expressing the FPRL-1 Receptor 
and Peptide Agonist. 

For verificatiin of pheromone pathway activation and quantification of the 
stimulation, the activity of the fusl promoter was determined colorimetrically using a fusl- 
lacZ fusion in a parallel set of test strains. CY1141, described above, was used as the 
recipient strain for these experiments. Transformants contained CADUS 1584 (pRS424- 
fusl-lacZ) in addition to receptor (R +/_ ) and ligand (L +/ -) piasmids. Four strains (bearing the 
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identical plasmids) were grown overnight in minimal media lacking leucine, uracil, and 
tryptophan, pH8.6. The overnight cultures were used to inoculate -Leu -Ura -Trp pH6.8 
media and these new cultures were grown for approximately 4.5-5 hours to an OD6 0 o of less 
than 0.4. Assay of p-galactosidase activity (Guarente 1983) in cells from these cultures 
yielded the following results: 

CY1 141/CADUS 231 1/peptide Al/CADUS 1584 R+L + 28 units 

CY1 141/CADUS 231 1/CADUS 1625/CADUS 1584 R + L~ 3 units 

CY1 141/CADUS 1289/peptide Al/CADUS 1584 R"L + 3.5 units 

CY1 141/CADUS 1289/CADUS 1625/CADUS 1584 R"L _ 3.9 units 

The presence of receptor and peptide-encoding plasmids resulted in an average 8-fold 
stimulation over background levels of p-galactosidase. 


Example 10: Identification of surrogate ligands using expression of a random peptide 
library in yeast expressing the orphan mammalian receptor, MDR-15. 

In a similar manner a plasmid encoding the monocyte derived receptor monocyte- 
derived receptor 15 (MDR15; Barella et al. (1995) Biochem. J. 309:773-9) was used to 
construct a yeast strain (CY6573) expressing this receptor. This receptor is an alternative 
spliced form of the Burkitt's lymphoma receptor 1 (BLR1) encoded by a human Burkitt's 
lymphoma cDNA (Dobner et al. (1992) Eur. J. Immunol 22, 2795-2799). Strain CY6573 
was transformed in a similar manner with the NNK13 library, and, following selection on ten 
-Leu-Ura (4.4 x 10 5 colonies per plate), replica plated to -Leu-Ura-His+ ImM AT plates. 
Upon reisolation of plasmid pools and re-transformation into strain CY6573; eight of ten 
pools showed signicantly enriched colony formation on -Leu-Ura-His+ ImM AT plates. 
Eight unique plasmids derived from these pools when retransformed into CY6573 conferred 
growth on -Leu-Ura-His+ ImM AT plates. One of these plasmids failed to confer growth in a 
yeast strain lacking the MDR1 5 receptor. 

Example 11; Identification of a ligand using expression of a random peptide library in 
yeast expressing the human thrombin receptor 

The receptor for thrombin, a G protein-coupled receptor, is present on numerous cell 
types including platelets, vascular smooth muscle, fibroblasts and on a subset of cells that 
function in immunity. Thrombin, a serine protease, binds to and cleaves the receptor 
molecule at residue 41, generating a new receptor N-terminus. The post-cleavage N-terminal 
residues then act as a "tethered ligand' to activate the receptor molecule (Vu et al. 1994). In 
platelets, signaling through the thrombin receptor has been shown to result in numerous 
effects including stimulation of phospholipase C, mobilization of intracellular Ca 2+ and 
inhibition of adenylyl cyclase. 


In this example experiments that detail the following will be described (1) 
establishment of a strain of yeast designed to express the human G protein-coupled receptor 
for thrombin; (2) expression of a random peptide library in the afore-mentioned strain of 
yeast and (3) activation of the endogenous yeast pheromone pathway upon stimulation of the 
thrombin receptor by peptides encoded by a random library expressed within the same strain 
of yeast. 

Preparation of a Yeast Expression Vector for a Mammalian Thrombin Receptor 

The human thrombin receptor was amplified by PCR from pcDNA3:Hu-Thr9b-5' 
(Bristol Myers Squibb) using the following oligonucleotides: 

5' GGGCCATGGGGCCGCGGCGGTTG 3* 

5' CCCGGATCCTAAGTTAACAGCTTTTTGTATAT 3' 

The amplified product was purified by gel electrophoresis, restricted with Ncol and BamHI 
and ligated to Ncol and BamHI-cut CADUS 1871, a PGK promoter-driven expression vector, 
to yield CADUS 2260. Cloning into CADUS 1871 introduces a novel stop codon preceded 
by the triplet GlySerVal after the authentic carboxy terminal codon of the human thrombin 
receptor (threonine). In addition, an invertase signal sequence is fused to the authentic amino 
terminus of the receptor. 

CY7467 exhibited LIRMA (ligand independent receptor mediated activation), that is, 
activation of the yeast pheromone pathway in the absence of ligand. It was determined that 
the yeast growth on selective media that resulted from LIRMA was eliminated by the 
addition of 2.5 millimolar concentrations of 3-aminotriazole (AT). AT is an inhibitor of the 
HIS3 gene product that serves to reduce background growth. Therefore, selection protocols 
aimed at the identification of novel peptide ligands for the human thrombin receptor were 
carried out at this concentration of AT. 

Preparation Of ftandom Oligonucleotide Library 
As described above. 

Recycling Protocol to Identify a S urrogate Ligand 

The yeast strain CY1141 (MATalpha farl*1442 tbtl-1 fusl-HIS3 canl 
stel4::trpl::LYS2 ste3*1156 gpal(41)-Galphai2 lys2 ura3 leu2 trpl his3) was transformed 
with CADUS 2260 to yield strain CY7467, expressing the human thrombin receptor. 
CY7467 was inoculated to 10 mis of standard synthetic media (SD) lacking leucine (-Leu) 
and incubated overnight at 30 C. The 10 ml overnight culture was used to inoculate 50 mis 
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of YEPD media; this culture was incubated at 30 C for 4.5-5 hours at which time the cells 
were harvested and prepared for transformation with DNA encoding a random peptide library 
[alpha-NNK (6.24.94)] by electroporation. Post electroporation (in 0.2 cm cuvettes, 0.25 mF, 
200 W, 1.5 kV) the cells were immediately diluted in 1 ml ice-cold 1M sorbitol and lOOmL 
aliquots were plated onto 10 synthetic media plates (pH6. 8) lacking leucine and uracil (-Leu- 
Ura). The plates were incubated at 30 C for 2-4 days at which time two replicas of each 
original transformation plate were made to synthetic media (pH6.8) lacking leucine, uracil 
and histidine and supplemented with 2.5mM AT(-Leu-Ura-His+ 2.5mM AT). The replicas 
were incubated at 30 C for 3-5 days. Post incubation the colonies present on the replica sets 
of two were scraped from the plates into a total of 10 mis of H20 (5 mis each plate). The 
OD^oo of each cell suspension was determined and crude plasmid isolations were done on 8- 
16 OD units of cells for each pool. A total of ten pools resulted. The pellets obtained from 
these crude plasmid isolations were resuspended in 40mL of 10 mM Tris, 1 mM EDTA, 
pH8.0 and lmL was used to transform E. coli by electroporation (0.1 cm cuvettes, 0.25 mF, 
200W, 1.8 kV). Post electroporation the cells were immediately diluted into 1 ml 2XYT 
media and incubated, with shaking, at 37 C for 30 minutes after which time the cells were 
used to inoculate 50 mis of 2xYT supplemented with 100 ug/ml ampicillin. The 10 resulting 
cultures were incubated at 37 C overnight. Plasmid DNA was isolated from each of these 
bacterial cultures using Qiagen columns (Qiagen, Inc., Chatsworth, CA). Each plasmid DNA 
pellet was resuspended in 50mL Tris lOmM, EDTA 1 mM, pH 8.0. 

Strain CY7467 was transformed with lmL of each plasmid pool by electroporation. 
Post electroporation the cells were diluted into 400mL 1M sorbitol. From each 
electroporated cell suspension, lmL and 400mL of cells were plated on -Leu-Ura synthetic 
media, pH6.8 to yield "low density" and "high density" platings. The plates were incubated 
at 30 C for 3 days, at which time replicas of both the low and high density plates were made 
to -Leu-Ura-His+ 2.5mM AT. For those cases where enrichment for a plasmid capable of 
conferring a His+ phenotype had occurred, this would be reflected by an amplified number of 
His+ colonies on both the low and high density plates visible at days 2-3, although the 
amplification would be most obvious on the plates that had received a high density of cells. 
In this experiment 3/10 pools showed amplification of His+ colonies. The cells from each of 
these plates were scraped into 5 mis of H2O, the OD^QO °f me ce ^ suspensions were 
determined and crude plasmid isolations were done on 8-16 OD units of yeast cells. The 
pellets obtained were resuspended in 40mL 10 mM Tris, 1 mM EDTA, pH8.0 and lmL was 
used to transform E. coli. Plasmid DNA was isolated by miniprep from 3 ml 2XYT cultures 
of single bacterial colonies resulting from these transformations (three bacterial colonies for 
each DNA pool were processed in this way). DNAs deriving from three individual bacterial 
colonies per pool were resuspended in 20mL 10 mM Tris 1 mM EDTA, pH8.0. The three 
DNAs derived per pool were sequenced and found to encode identical peptides. Thus three 



differing DNA sequences were derived, one representing each amplified pool. One plasmid 
representing each of the three original amplified pools was used to transform CY7467 
(containing the thrombin receptor expression vector) and GY6263 (CY1141 containing a 
control expression vector lacking any receptor sequence) by electroporation. CADUS 1625, 
a control vector lacking sequences encoding a peptide was included and used to transform 
both the receptorf and receptor- strains of yeast. CADUS 1651, a control vector lacking 
sequences encoding a receptor included and used to transform both the ligand+ and ligand- 
strains of yeast.. Transformants were first selected on -Leu-Ura, pH6.8, then two yeast 
transformants of each type were patched to -Leu-Ura, pH6.8 to expand the colonies. Once 
expanded, streaks of the transformants were made on -Leu-Ura-His+ 2.5mM AT to test for 
growth in the absence of histidine. One of the three plasmids tested conferred a growth 
advantage on media lacking histidine to yeast bearing the thrombin-encoding plasmid but not 
to yeast lacking the receptor plasmid. The peptide sequence encoded by this plasmid is: Val- 
Cys-Pro-AIa-Arg-Tyr-Val-Leu-Pro-Gly-Pro-Val-Leu and was encoded by the nucleotide 
sequence GTT TGT CCT GCG CGT TAT GTG CTG CCT GGG CCT GTT TTG. 


Table i. Detection of C5a production in 


_ R - L - R + L-. P-T.4. 


yeast by ELISA. 


CC5a] in cuiture n. d .~7^r^ 

tcsa] released n d n d n =6 ° «" 

■ °- 8 n 9/ml 0.6 no/ml 

from lysed cells* =97 ^ ^ /ml 

CSa was detected by enzyme- linked immunosorbent assay (Hl ISA> 

* Determined by nellptinn r^i i ^ 

y pelleting cells, resuspending cells in t-hp 
crxgxnax volume , breaking yeasfc with ^ ^ * * - *e 

resulting supernatant. g 
n.d.=not done 


Table 2. 
Chimera 

GPAl 41 -Go:i2 


Coupling of th C5a receptor Q Ga chimer 

ExDress nn _ yeasc 


GPAl 41 -Gori3 


GPAl/?am-Gai2 


GPAi/?am-Gc*16 


GPAl^am-Gofs 


Ex pression 
Confcexh 
single copy, 
integrated, 
GPA1 promoter 
single copy, 
integrated, 
GPAl promoter 

low copy plasmid, 
GPA 1 promoter 

low copy plasmid, 
GPAl p romoter 


low copy plasmid, 
GPAl promoter 


Result 

Good signal to noise ratio. - 
efficient coupling to yeast 

Poor signal to noise ratio: 
high background due to poor 
coupling to yeast 0y, high 
LIRMA* . 

Signal equal to that with 
GPAl 4I -Gai2, however, back- 
ground is greater. 
Poor signal to noise ratio, 
high background due to poor 
coupling to yeast 0y , high 
LIRMA*. • • 

""acceptably high background 
due to poor coupling to 
yeast 0- . hign LIRMA*. 
*LIRMA = L igand Xndependent g eceDtor Me(U . 

thxe phenomenon, there is an increas^e " ^ tlVatl ° n - 
media for strains containino hpt _ ln gr ° Wth °" selective 

r — xt is pos :::™^z^t: in the ^ 

decrease MMa . It has been notgd antagonists would 

T Cl£iC "^iats reduce LIRMA of the T 2 ' adr l ^ " ." 94) ^ 
when that receptor is overexpres ~ed • , adrener 9 lc receptor 

— ^ be exploi :;rr se ;; ra r ns9enic mice - 

identification of antagonists each includi »9 the 

- subset of antagonist wo " ^ c ?Jt^f 
conformation in such a _*Pected to affect the receptor 

signaling that occurs in the ah " ^ d0wnst «am 

exploited to identify new a TrT" ^ ^ 

expressing CDNA clon es L yeast s^ra ns ° UPled " C ^ t ~» 

G Proteins which couple only Jlf . eXpressi »9 tho. e chimeric 

lirma may permit thVi^Lrl L r st Py - In additio - 

specific for G proteins dentlfl ^ tl0n ° f inhibitors that are 


Table 3.. Coupling of Got switch region hybrids to the 
pheromone response pathway. 


Protein 

GPA1 amino 
acid sequences 

Gas amino acid 
sequences 


GPA1 

1-472 

none 

Couples with 
Gpy 

GaS 

none 

1-394 

Couples with 
Gfly weakly 

GPA^-S 

1-41 

42-394 

Couples with 
Gpy weakly 

SGS 

297-333 

1-201 + 237- 
394 . 

Does not 
couple with 

GPA 4I -SGS 

xl-41 + 297-333 

42-201 + 237- 
394 

couples with 
GjSy weakly 


